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PREFACE 


In 1992 after finishing my book, “The Transition of Chaos,” I realized that I 
needed to write a new edition of “A Modem Course in Statistical Physics”. I 
wanted to adjust the material to better prepare students for what I believe are the 
current directions of statistical physics. I wanted to place more emphasis on 
nonequilibrium processes and on the thermodynamics underlying biological 
processes. I also wanted to be more complete in the presentation of material. It 
turned out to be a greater task than I had anticipated, and now five years later I 
am finally finishing the second edition. One reason it has taken so long is that I 
have created a detailed solution manual for the second edition and I have added 
many worked out exercises to the text. In this way I hope I have made the second 
edition much more student and instructor friendly than the first edition was. 

There are two individuals who have had a particularly large influence on this 
book and whom I want to thank, even though they took no part in writing the 
book. (Any negative features of this book are totally my responsibility.) The 
biggest influence has been Ilya Prigogine, and for that reason I have dedicated 
this book to him. When I first came to the University of Texas to join the 
Physics faculty, I became a member of what was then the Center for 
Thermodynamics and Statistical Mechanics (now known as the Prigogine 
Center for Statistical Mechanics and Complex Systems). My training was in 
equilibrium statistical mechanics. But that changed when I learned that the 
focus of this unique research Center, deep in the heart of Texas, was on 
nonequilibrium nonlinear phenomena, most of it far from equilibrium. I began 
to work on nonequilibrium and nonlinear phenomena, but followed my own 
path. The opportunity to teach and work in this marvelous research center and 
to listen to the inspiring lectures of Ilya Prigogine and lectures of the many 
visitors to the Center has opened new worlds to me, some of which I have tried 
to bring to students through this book. 

The other individual who has had a large influence on this book is Nico van 
Kampen, a sometimes visitor to the University of Texas. His beautiful lectures on 
stochastic processes were an inspiration and spurred my interest in the subject. 

I want to thank the many students in my statistical mechanics classes who 
helped me shape the material for this book and who also helped me correct the 
manuscript. 

This book covers a huge range of material. I could not reference all the work 
by the individuals who have contributed in all these areas. I have referenced 
work which most influenced my view of the subject and which could lead 
students to other related work. I apologize to those whose work I have not been 
able to include in this book. 

L. E. Reichl 

Austin, Texas 
September 1997 




1 

INTRODUCTION 


l.A. OVERVIEW 

The field of statistical physics has expanded dramatically in recent years. New 
results in ergodic theory, nonlinear chemical physics, stochastic theory, 
quantum fluids, critical phenomena, hydrodynamics, transport theory, and 
biophysics have revolutionized the subject, and yet these results are rarely 
presented in a form that students who have little background in statistical 
physics can appreciate or understand. This book has been written in an effort to 
incorporate these subjects into a basic course on statistical physics. It includes 
in a unified and integrated manner the foundations of statistical physics and 
develops from them most of the tools needed to understand the concepts 
underlying modem research in all of the above fields. 

In the field of ergodic theory, for example, chaos theory has deepened our 
understanding of the structure and dynamical behavior of a variety of nonlinear 
systems and has made ergodic theory a modem field of research. Indeed, one of 
the frontiers of science today is the study of the spectral properties of decay 
processes in nature, based on the chaotic nature of the underlying dynamics of 
those systems. Advances in this field have been aided by the development of 
ever more powerful computers. In an effort to introduce this field to students, a 
careful discussion is given of the behavior of probability flows in phase space, 
including specific examples of ergodic and mixing flows. 

Nonlinear chemical physics is still in its infancy, but it has already given a 
conceptual framework within which we can understand the thermodynamic 
origin of life processes. The discovery of dissipative structures (nonlinear 
spatial and temporal structures) in nonlinear nonequilibrium chemical systems 
has opened a new field in chemistry and biophysics. In this book, material has 
been included on chemical thermodynamics, chemical hydrodynamics, and 
nonequilibrium phase transitions in chemical and hydrodynamic systems. 

The use of stochastic theory to study fluctuation phenomena in chemical and 
hydrodynamic systems, along with its growing use in population dynamics and 
complex systems theory, has brought new life to this field. The discovery of 
scaling behavior at all levels of the physical world, along with the appearance of 
Levy flights which often accompanies scaling behavior, has forced us to think 




2 


INTRODUCTION 


beyond the limits of the Central Limit Theorem. In order to give students some 
familiarity with modem concepts from the field of stochastic theory, we have 
placed probability theory in a more general framework and discuss, within that 
framework, classical random walks, Levy flights, and Brownian motion. 

The theory of superfluids rarely appears in general textbooks on statistical 
physics, but the theory of such systems is incorporated at appropriate places 
throughout this book. We discuss the thermodynamic properties of superfluid 
and superconducting systems, the Ginzburg-Landau theory of superconductors, 
the BCS theory of superconductors, and superfluid hydrodynamics. Also 
included in the book is an extensive discussion of properties of classical fluids 
and their thermodynamic and hydrodynamic properties. 

The theory of phase transitions has undergone a revolution in recent years. In 
this book we define critical exponents and use renormalization theory to 
compute them. We also derive an exact expression for the specific heat of the 
two-dimensional Ising system, one of the simplest exactly solvable systems 
which can exhibit a phase transition. At the end of the book we include an 
introduction to the theory of nonequilibrium phase transitions. 

Hydrodynamics is a very powerful tool for understanding long-wavelength 
phenomena in classical fluids, solids, liquid crystals, superfluids, and biological 
systems. This book contains a thorough grounding in hydrodynamics based 
on the underlying symmetries and stability properties of matter. We discuss 
properties of correlation functions, causality, the fluctuation-dissipation 
theorem, the theory of light scattering, and the origin of hydrodynamics in 
terms of conserved quantities and broken symmetries. We also include a 
variety of applications of the hydrodynamics of mixtures, a subject essential for 
biophysics. 

Transport theory is discussed from many points of view. We derive Onsager’s 
relations for transport coefficients. We derive expressions for transport 
coefficients based on simple “back of the envelope” mean free path arguments. 
The Boltzmann and Lorentz-Boltzmann equations are derived and microscopic 
expressions for transport coefficients are obtained, starting from spectral 
properties of the Boltzmann and Lorentz-Boltzmann collision operators. The 
difficulties in developing a convergent transport theory for dense gases are also 
reviewed. 

Concepts developed in statistical physics underlie all of physics. Once the 
forces between microscopic particles are determined, statistical physics gives us 
a picture of how microscopic particles act in the aggregate to form the 
macroscopic world. As we see in this book, what happens on the macroscopic 
scale is sometimes surprising. 


I.B. PLAN OF BOOK 

Thermodynamics is a consequence and a reflection of the symmetries of nature. 
It is what remains after collisions between the many degrees of freedom of 



PLAN OF BOOK 


3 


macroscopic systems randomize and destroy most of the coherent behavior. The 
quantities which cannot be destroyed, due to underlying symmetries of nature 
and their resulting conservation laws, give rise to the state variables upon which 
the theory of thermodynamics is built. Thermodynamics is therefore a solid and 
sure foundation upon which we can construct theories of matter out of 
equilibrium. That is why we place heavy emphasis on it in this book. 

The book is divided into four parts. Chapters 2 and 3 present the foundations 
of thermodynamics and the thermodynamics of phase transitions. Chapters 4 
through 6 present probability theory, stochastic theory, and the foundations of 
statistical mechanics. Chapters 7 through 9 present equilibrium statistical 
mechanics, with emphasis on phase transitions and the equilibrium theory of 
classical fluids. Chapters 10 through 12 deal with nonequilibrium processes, 
both on the microscopic and macroscopic scales, both near and far from 
equilibrium. The first two parts of the book essentially lay the foundations for 
the last two parts. 

There seems to be a tendency in many books to focus on equilibrium 
statistical mechanics and derive thermodynamics as a consequence. As a result, 
students do not get the experience of traversing the vast world of 
thermodynamics and do not understand how to apply it to systems which are 
too complicated for statistical mechanics. For this reason, we begin the book 
with a thorough grounding in thermodynamics. In Chapter 2 we review the 
foundations of thermodynamics and thermodynamic stability theory and devote 
a large part of the chapter to a variety of applications which do not involve 
phase transitions, such as the cooling of gases, mixing, osmosis, and chemical 
thermodynamics. Chapter 3 is devoted to the thermodynamics of phase 
transitions and the use of thermodynamic stability theory in analyzing these 
phase transitions. We discuss first-order phase transitions in liquid-vapor-solid 
transitions, with particular emphasis on the liquid-vapor transition and its 
critical point and critical exponents. We also introduce the Ginzburg-Landau 
theory of continuous phase transitions and discuss a variety of transitions which 
involve broken symmetries. 

Having developed some intuition concerning the macroscopic behavior of 
complex equilibrium systems, we then turn to microscopic foundations. 
Chapters 4 through 6 are devoted to probability theory and the foundations of 
statistical mechanics. Chapter 4 contains a review of basic concepts from 
probability theory and then uses these concepts to describe classical random 
walks and Levy flights. The Central Limit Theorem and the breakdown of the 
Central Limit Theorem for scaling processes is described. In Chapter 5 we 
study the dynamics of discrete stochastic variables based on the master 
equation. We also introduce the theory of Brownian motion and the idea of 
separation of time scales, which has proven so important in describing 
nonequilibrium phase transitions. The theory developed in Chapter 5 has many 
applications in chemical physics, laser physics, population dynamics, and 
biophysics, and it prepares the way for more complicated topics in statistical 
mechanics. 
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Chapter 6 lays the probabilistic foundations of statistical mechanics, starting 
from ergodic theory. In recent years, there has been a tendency to sidestep this 
aspect of statistical physics completely and to introduce statistical mechanics 
using information theory. The student then misses one of the current frontiers of 
modem physics, the study of the spectral behavior of decay processes in nature, 
based on the chaotic nature of the underlying dynamics of those systems. While 
we cannot go very far into this subject in this book, we at least discuss the 
issues. We begin by deriving the Liouville equation, which is the equation of 
motion for probability densities, both in classical mechanics and in quantum 
mechanics. We look at the types of flow that can occur in mechanical systems 
and introduce the concepts of ergodic and mixing flows, which appear to be 
minimum requirements if a system is to decay to thermodynamic equilibrium. 

Chapters 7-9 are devoted entirely to equilibrium statistical mechanics. In 
Chapter 7 we derive the probability densities (the microcanonical, canonical, 
and grand canonical ensembles) for both closed and opened systems and relate 
them to thermodynamic quantities and the theory of fluctuations. We then use 
them to derive the thermodynamic properties of a variety of model systems, 
including harmonic lattices, spin systems, ideal quantum gases, and super- 
conductors. 

In Chapter 8 we introduce the equilibrium fluctuation theory of spin systems 
and show qualitatively how the spatial extent of correlations between fluctua- 
tions diverges as we approach the critical point. We also introduce the idea of 
scaling and use renormalization theory to obtain microscopic expressions for 
the critical exponents of spin lattices. Finally we conclude Chapter 8 by 
obtaining an exact expression for the heat capacity of the two-dimensional Ising 
lattice, and we compare our exact expressions to those of mean field theory. 

Chapter 9 is devoted to the equilibrium theory of classical fluids. In this 
chapter we relate the thermodynamic properties of classical fluids to the 
underlying radial distribution function, and we use the Ursell-Mayer cluster 
expansion to obtain a virial expansion of the the equation of state of a classical 
fluid. We also discuss how to include quantum corrections for nondegenerate 
gases. 

The last part of the book, Chapters 10-12, deals with nonequilibrium 
processes. Chapter 10 is devoted to hydrodynamic processes for systems near 
equilibrium. We begin by deriving the Navier-Stokes equations from the 
symmetry properties of a fluid of point particles, and we use the derived 
expression for entropy production to obtain the transport coefficients for the 
system. We use the solutions of the linearized Navier-Stokes equations to 
predict the outcome of light-scattering experiments. We go on to derive 
Onsager’s relations between transport coefficients, and we use causality to 
derive the fluctuation-dissipation theorem. We also derive a general expression 
for the entropy production in systems with mixtures of particles which can 
undergo chemical reactions. We then use this theory to describe thermal and 
chemical transport processes in mixtures, across membranes, and in electrical 
circuits. The hydrodynamic equations describe the behavior of just a few slowly 



USE AS A TEXTBOOK 


5 


varying degrees of freedom in fluid systems. If we assume that the remainder of 
the fluid can be treated as a background noise, we can use the fluctuation- 
dissipation theorem to derive the correlation functions for this background 
noise. In Chapter 10 we also consider hydrodynamic modes which result from 
broken symmetries, and we derive hydrodynamic equations for superfluids and 
consider the types of sound that can exist in such fluids. 

In Chapter 11 we derive microscopic expressions for the coefficients of 
diffusion, shear viscosity, and thermal conductivity, starting both from mean free 
path arguments and from the Boltzmann and Lorentz-Boltzmann equations. In 
deriving microscopic expressions for the transport coefficients from the 
Boltzmann and Lorentz-Boltzmann equations, we use a very elegant method 
which relies on use of the eigenvalues and eigenfunctions of the collision 
operators associated with those equations. We obtain explicit microscopic 
expressions for the transport coefficients of a hard sphere gas. 

Finally, in Chapter 12 we conclude with the fascinating subject of 
nonequilibrium phase transitions. We discuss thermodynamic stability theory 
for systems far from equilibrium. We also show how nonlinearities in the rate 
equations for chemical reaction-diffusion systems lead to nonequilibrium phase 
transitions which give rise to chemical clocks, nonlinear chemical waves, and 
spatially periodic chemical structures, while nonlinearities in the Rayleigh- 
Benard hydrodynamic system lead to spatially periodic convection cells. We 
shall also examine the nature of fluctuations in the neighborhood of the critical 
point for these transitions and show that they are characterized by a critical 
slowing down of certain unstable modes. 


l.C. USE AS A TEXTBOOK 

Even though this book contains a huge amount of material, it has been 
designed to be used as a textbook. In each chapter the material has been divided 
into core topics and special topics. The core topics provide key basic material in 
each chapter, while special topics illustrate these core ideas with a variety of 
applications. The instructor can select topics from the special topics sections, 
according to the emphasis he/she wishes to give the course. 

In many sections, we have included nontrivial demonstration exercises to 
help the students understand the material and to help in solving homework 
problems. Each chapter has a variety of problems at the end of the chapter that 
can be used to help the students test their understanding. 

Even if one covers only the core topics of each chapter, there may be too 
much material to cover in a one-semester course. However, the book is designed 
so that some chapters may be omitted completely. The choice of which chapters 
to use depends on the interests of the instructor. Our suggestion for a basic 
well-rounded one-semester course in statistical physics is to cover the core 
topics in Chapters 2, 3, 4, 7, 10, and 11 (only Section ll.B if time is running 
short). 
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The book is intended to introduce the students to a variety of subjects and 
resource materials which they can then pursue in greater depth if they wish. We 
have tried to use standardized notation as much as possible. In writing a book 
which surveys the entire field of statistical physics, it is impossible to include or 
even to reference everyone’s work. We have included references which were 
especially pertinent to the points of view we take in this book and which will 
lead students easily to other work in the same field. 



PART ONE 


THERMODYNAMICS 
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INTRODUCTION TO 
THERMODYNAMICS 


2.A. INTRODUCTORY REMARKS 

The science of thermodynamics began with the observation that matter in the 
aggregate can exist in macroscopic states which are stable and do not change in 
time. These stable “equilibrium” states are characterized by definite 
mechanical properties, such as color, size, and texture, which change as the 
substance becomes hotter or colder (changes its temperature). However, any 
given equilibrium state can always be reproduced by bringing the substance 
back to the same state. Once a system reaches its equilibrium state, all changes 
cease and the system will remain forever in that state unless some external 
influence acts to change it. This inherent stability and reproducibility of the 
equilibrium states can be seen everywhere in the world around us. 

Thermodynamics has been able to describe, with remarkable accuracy, the 
macroscopic behavior of a huge variety of systems over the entire range of 
experimentally accessible temperatures (10 -4 K to 10 6 K). It provides a truly 
universal theory of matter in the aggregate. And yet, the entire subject is based 
on only four laws, which may be stated rather simply as follows: Zeroth Law — 
it is possible to build a thermometer; First Law — energy is conserved; Second 
Law — not all heat energy can be converted into work; and Third Law — we can 
never reach the coldest temperature using a finite set of reversible steps. 
However, even though these laws sound rather simple, their implications are 
vast and give us important tools for studying the behavior and stability of 
systems in equilibrium and, in some cases, of systems far from equilibrium. 

The core topics in this chapter focus on a review of various aspects of 
thermodynamics that will be used throughout the remainder of the book. The 
special topics at the end of this chapter give a more detailed discussion of some 
applications of thermodynamics which do not involve phase transitions. Phase 
transitions will be studied in Chapter 3. 

We shall begin this chapter by introducing the variables which are used in 
thermodynamics and the mathematics needed to calculate changes in the 
thermodynamic state of a system. As we shall see, many different sets of 
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mechanical variables can be used to describe thermodynamic systems. In order 
to become familiar with some of these mechanical variables, we shall write the 
experimentally observed equations of state for a variety of thermodynamic 
systems. 

As we have mentioned above, thermodynamics is based on four laws. We 
shall discuss the content of these laws in some detail, with particular emphasis 
on the second law. The second law is extremely important both in equilibrium 
and out of equilibrium because it gives us a criterion for testing the stability of 
equilibrium systems and, in some cases, nonequilibrium systems. 

There are a number of different thermodynamic potentials that can be used to 
describe the behavior and stability of thermodynamic systems, depending on 
the type of constraints imposed on the system. For a system which is isolated 
from the world, the internal energy will be a minimum for the equilibrium state. 
However, if we couple the system thermally, mechanically, or chemically to the 
outside world, other thermodynamic potentials will be minimized. We will intro- 
duce the five most commonly used thermodynamic potentials (internal energy, 
enthalpy, Helmholtz free energy, Gibbs free energy, and the grand potential), 
and we will discuss the conditions under which each one is minimized at equili- 
brium and why they are called potentials. 

When experiments are performed on thermodynamic systems, the quantities 
which are easiest to measure are the response functions. Generally, we change 
one parameter in the system and see how another parameter responds to that 
change, under highly controlled conditions. The quantity that measures the way 
in which the system responds is called a response function. In this chapter we 
shall introduce a variety of thermal and mechanical response functions and give 
relations between them. 

Isolated equilibrium systems are systems in a state of maximum entropy. 
Any fluctuations which occur in such systems must cause a decrease in entropy 
if the equilibrium state is to be stable. We can use this fact to find relations 
between the intensive state variables for different parts of a system if those parts 
are to be in mechanical, thermal, and chemical equilibrium. In addition, we can 
find restrictions on the sign of the response functions which must be satisfied for 
stable equilibrium. We shall find these conditions and discuss the restrictions 
they place on the Helmholtz and Gibbs free energy. 

Thermodynamics becomes most interesting when it is applied to real 
systems. In order to demonstrate its versatility, in the section on special topics, 
we shall apply it to a number of systems which have been selected for their 
practical importance or conceptual importance. 

We begin with a subject of great practical and historic importance, namely, 
the cooling of gases, it is often necessary to cool substances below the 
temperature of their surroundings. The refrigerator most commonly used for 
this purpose is based on the Joule-Kelvin effect. There are two important ways 
to cool gases. We can let them do work against their own intermolecular forces 
by letting them expand freely (Joule effect); or we can force them through a 
small constriction, thus causing cooling at low temperatures or heating at high 
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temperatures (Joule-Kelvin effect). The Joule-Kelvin effect is by far the more 
effective of the two methods. We shall discuss both methods in this chapter and 
use the van der Waals equation of state to obtain estimates of the cooiling 
effects for some real gases. 

For reversible processes, changes in entropy content can be completely 
accounted for in terms of changes in heat content. For irreversible processes, 
this is no longer true. We can have entropy increase in an isolated system, even 
though no heat has been added. Therefore, it is often useful to think of an 
increase in entropy as being related to an increase in disorder in a system. One 
of the most convincing illustrations of this is the entropy change which occurs 
when two substances, which have the same temperature and pressure but 
different identities, are mixed. Thermodynamics predicts that the entropy will 
increase solely due to mixing of the substances. 

When the entropy of a system changes due to mixing, so will other thermo- 
dynamic quantities. One of the most interesting examples of this is osmosis. We 
can fill a container with water and separate it into two parts by a membrane 
permeable to water but not salt, for example. If we put a small amount of salt 
into one side, the pressure of the resulting salt solution will increase markedly 
because of mixing. 

Chemical reactions can be characterized in a rather simple way in terms of a 
thermodynamic quantity called the affinity. The affinity gives a measure of the 
distance of a chemical reaction from thermodynamic equilibrium and will be 
useful in later chapters when we discuss chemical systems out of equilibrium. 
We can obtain an expression for the affinity by using the conditions for 
thermodynamic equilibrium and stability introduced in Chapter 2, and at the 
same time we can learn a number of interesting facts about the thermodynamic 
behavior of chemical reactions. A special example of a type of reaction 
important to biological systems is found in electrolytes, which consist of salts 
which can dissociate but maintain an electrically neutral solution. 


2.B. STATE VARIABLES AND EXACT DIFFERENTIALS 

Thermodynamics describes the behavior of systems with many degrees of 
freedom after they have reached a state of thermal equilibrium — a state in 
which all past history is forgotten and all macroscopic quantities cease to 
change in time. The amazing feature of such systems is that, even though they 
contain many degrees of freedom (~ 10 23 ) in chaotic motion, their thermo- 
dynamic state can be specified completely in terms of a few parameters — called 
state variables. In general, there are many state variables which can be used to 
specify the thermodynamic state of a system, but only a few (usually two or 
three) are independent. In practice, one chooses state variables which are 
accessible to experiment and obtains relations between them. Then, the 
“machinery” of thermodynamics enables one to obtain the values of any other 
state variables of interest. 
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State variables may be either extensive or intensive. Extensive variables 
always change in value when the size (spatial extent and number of degrees of 
freedom) of the system is changed, and intensive variables need not. Certain 
pairs of intensive and extensive state variables often occur together because 
they correspond to generalized forces and displacements which appear in 
expressions for thermodynamic work. Some examples of such extensive and 
intensive pairs are, respectively, volume, V, and pressure, P; magnetization, M, 
and magnetic field strength, H; length, L, and tension, J; area, A, and surface 
tension, a; electric polarization, P, and electric field, E. The pair of state 
variables related to heat content of thermodynamic system are the temperature, 
T, which is intensive, and the entropy, S, which is extensive. There is also a pair 
of state variables associated with “chemical” properties of a system. They are 
the number of particles, N, which is extensive, and the chemical potential per 
particle, // , which is intensive. In this book we shall sometimes use the number 
of moles, n, and the chemical potential per mole, /i (molar chemical potential); 
or the mass of a substance, M, and the chemical potential per unit mass, /i, 
(specific chemical potential), as the chemical state variables. If there is more 
than one type of particle in the system, then there will be a* mole number and 
chemical potential associated with each type of particle. 

Other state variables used to describe the thermodynamic behavior of a 
system are the various response functions, such as heat capacity, C; compres- 
sibility, /t; magnetic susceptibility, and various thermodynamic potentials, 
such as the internal energy, U; enthalpy, H ; Helmholtz free energy, A; Gibbs 
free energy, G; and the grand potential, H. We shall become thoroughly 
acquainted with these state variables in subsequent sections. 

If we change the thermodynamic state of our system, the amount by which 
the state variables change must be independent of the path taken. If this were 
not so, the state variables would contain information about the history of the 
system. It is precisely this property of state variables which makes them so 
useful in studying changes in the equilibrium state of various systems. Mathe- 
matically changes in state variables correspond to exact differentials [ 1 ]; 
therefore, before we begin our discussion of thermodynamics, it is useful to 
review the theory of exact differentials. This will be the subject of the 
remainder of this section. 

Given a function F = F(x 1,^2) depending on two independent variables x\ 
and *2, the differential of F is defined as follows: 

^=(£)^ ,+ (£) Ji ^ (2 - !) 

where {dF/d x\ ) is the derivative of F with respect to x\ holding *2 fixed. If F 
and its derivatives are continuous and 
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then dF is an exact differential. If we denote 


dF\ 


( dF\ 


c i(*i,* 2 ) = ( -5— and c 2 (x u x 2 ) s ( — , 


dxij 


\dx 2 ) 


then the variables c\ and x\ and variables c 2 and x 2 are called “conjugate” 
variables with respect to the function F. 

The fact that dF is exact has the following consequences: 


(a) The value of the integral 


F(B) - F(A) = 


» B (■ B 

dF = j (c\dxi T c 2 dx 2 ) 

A JA 


is independent of the path taken between A and B and depends only on 
the end points A and B. 

(b) The integral of dF around a closed path is zero: 


(b dF = <p (c\dx\ + c 2 dx 2 ) = 0. 

J closed J closed 

(c) If one knows only the differential dF, then the function F can be found to 
within an additive constant. 


If F depends on more than two variables, then the statements given above 
generalize in a simple way: Let F = F(x\,x 2 , . . . ,x n ), then the differential, dF, 
may be written 



(2.3) 


The notation (dF/dxfi^ ^ means that the derivative of F is taken with respect 
to xi holding all variables but x, constant. For any pair of variables, the 
following relation holds: 


An example for the case of three independent variables is 


d_fdF\ 

dx > WJfe.) 




d_(dF\ 

[dx k \dx,) Ui/ij 


(2.4) 


{*//*} 


dF = c\dx\ + c 2 dx 2 + c 3 dx 3. 
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The Eq. (2.4) leads to the result 

( dc '\ _ ( dci \ ( dc] \ = ( dc A 

\ dx 2j Xux ~ \ dx J Xu x2 \8*i / 

(dc2\ = (dcA 

\ dx J X2 , X] \ dx 2j X3 , Xl ' 


Differentials of all state variables are exact and have the above properties. 
Given four state variables, jc, y, z, and w, where w is a function of any two of the 
variables jc, y, or z, one can obtain the following useful relations along paths for 
which F(. jc, y, z) = 0 : 





/3x\ f dy\ 

/d: c\ f &c\ / dw\ 


(2.5) 

(2.6) 

(2.7) 

(2.8) 


It is a simple matter to derive Eqs. (2.5)-(2.8). We will first consider Eqs. (2.5) 
and (2.6). Let us choose variables y and z to be independent, x = Jc(y,z), and 
then choose jc and z to be independent, y = y(x, z), and write the following 
differentials; dx = (dx/dy) z dy + (dx/dz) y dz and dy = (dy/dx) z dx 4- ( dy / 
dz) x dz- 

If we eliminate dy between these equations, we obtain 




dx 4- 


/ dx 

\dy. 




dz = 0. 


Because dx and dz may be varied independently, their coefficients may be set 
equal to zero separately. The result is Eqs. (2.5) and (2.6). 

To derive Eq. (2.7) we let y and z be independent so that jc = jc(y, z) and write 
the differential for dx. If we then divide by dw, we obtain 


djc _ / &c\ dy f chr\ dz 
dw \dyj z dw \dzj y dw' 


For constant z, dz = 0 and we find Eq. (2.7) 
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Finally, to derive Eq. (2.8) we let x be a function of y and w, x = x(y , w). If 
we write the differential of x, divide it by dy, and restrict the entire equation to 
constant z, we obtain Eq. (2.8). 

When integrating exact differentials, one must be careful not to lose terms. In 
Exercise 2.1, we illustrate two different methods for integrating exact 
differentials. 


■ EXERCISE 2.1. Consider the differential dcf) = (x 2 4 y)dx + x dy. (a) 
Show that it is an exact differential, (b) Integrate d(f) between the points A 
and B in the figure below, along the two different paths, 1 and 2. (c) Integrate 
dcf) between points A and B using indefinite integrals. 



Answer: 


(a) From the expression dcf) = (x? -F y)dx 4- xdy, we can write 
(d(f)/dx) = x 1 +y and (d<p/dy) x = x. Since [(d/dy)(d<j>/dx) y ] x = 
[{d/dx)(d <f>/dy) x ] y = 1 , the differential, dcj), is exact. 

(b) Let us first integrate the differential, dcf), along path 1: 


4 >B — <j>. A = 


J yB 

x B dy 

yA 

= 5-4 + x B y B - ±4 - 



Let us next integrate the differential, dcf), along path 2. Note that along 
path 2, y = y A + (Ay/Ax)(x - x A ), where Ay = y B - y A and 
Ax — x B — x A . If we substitute this into the expression for dcj>, we 
find dcf) = (x 2 4- y)dx 4- xdy = [ jc 2 4- yA 4- ( Ay/Ax)(lx - x A )\dx. 
Therefore 


- 0a = J dx x 2 +y A A^(2x-x A )j 

3 x B Wfi — 3 x A x A y A . 


( 2 ) 


Note that the change in cf) in going from point A to point B is 
independent of the path taken. This is a property of exact differentials. 
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(c) We now integrate the differential, d(f>, in a different way. Let us first 
do the indefinite integrals 

dx = ! (x 2 + y)dx = I* 3 + xy + K\ (y), (3) 


where ATi (y) is an unknown function of y. Next do the integral 

| (fy) dy = \ xd y = x y + K ^ x ) ’ ( 4 ) 

where Ki{x) is an unknown function of x. In order for Eqs. (3) and (4) 
to be consistent, we must choose Ki{x) = \ x 3 4- K 3 and K\ (y) = K 3 , 
where K 3 is a constant. Therefore, 0 = \x 7> 4-xy 4- K 3 and again, 
(f>B - 04 = 4- X B yB - \x\ - X A yA- 



2.C. SOME MECHANICAL EQUATIONS OF STATE 

An equation of state is a functional relation between the state variables for a 
system in equilibrium which reduces the number of independent degrees of 
freedom needed to describe the state of the system. It is an equation which 
relates the thermal state variables, T or S, to the mechanical and chemical state 
variables for that system and contains a great deal of information about the 
thermodynamic behavior of the system. It is useful to give some examples of 
empirically obtained equations of state. 

2.C.I. Ideal Gas Law 

The best-known equation of state is the ideal gas law 

PV = nRT, (2.9) 

where n is the number of moles, T is temperature in degrees Kelvin, P is the 
pressure in Pascals, V is the volume in cubic meters, and R = 8.314 J/mol- K is 
the universal gas constant. The ideal gas law gives a good description of a gas 
which is so dilute that the effect of interaction between particles can be 
neglected. 

If there are m different types of particles in the gas, then the ideal gas law 
takes the form 

m 

pv = Yl n ‘ RT ’ < 2 - 10 ) 

f=i 

where n, is the number of moles of the ith constituent. 
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2.C.2. Virial Expansion [2] 

The virial expansion, 


P = 



l + ^B 2 (T) + (^yB 3 (T) 


+ 


(2.11) 


expresses the equation of state of a gas as a density expansion. The quantities 
B 2 (T) and B${T) are called the second and third virial coefficients and are 
functions of temperature only. As we shall see in Chapter 9, the virial 
coefficients may be computed in terms of the interparticle potential. 
Comparison between experimental and theoretical values for the virial 
coefficients is an important method for obtaining the force constants for 
various interparticle potentials. In Fig. 2.1 we have plotted the second virial 
coefficient for helium and argon. The curves are typical of most gases. At low 
temperatures, B 2 (T) is negative because the kinetic energy is small and the 
attractive forces between particles reduce the pressure. At high temperatures the 
attractive forces have little effect and corrections to the pressure become 
positive. At high temperature the second virial coefficient has a maximum. 

For an ideal classical gas all virial coefficients, £,(/ > 2), are zero, but for an 
ideal quantum gas (Bose-Einstein or Fermi-Dirac) the virial coefficients are 



Fig. 2.1. A plot of the second virial coefficients for helium and argon in terms of the 
dimensionless quantities, B* = B 2 j bo and T* = k B T/e , where bo and e are constants, 
k B is Boltzmann’s constant, and T is the temperature. For helium, bo — 21.07x 
10~ 6 m 3 /mol and e/kg = 10.22 K. For argon, bo = 49.8 x 10 _6 m 3 /mol and 
e/k B = 119.8 K. (Based on Ref. 2.) 
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nonzero. The “statistics” of quantum particles give rise to corrections to the 
classical ideal gas equation of state. 

2.C.3. Van der Waals Equation of State [3] 

The van der Waals equation of state is of immense importance historically 
because it was the first equation of state which applies to both the gas and liquid 
phases and exhibits a phase transition. It contains most of the important 
qualitative features of the gas and liquid phases although it becomes less 
accurate as density increases. The van der Waals equation contains corrections 
to the ideal gas equation of state, which take into account the form of the 
interaction between real particles. The interaction potential between molecules 
in a gas contains a strong repulsive core and a weaker attractive region 
surrounding the repulsive core. For an ideal gas, as the pressure is increased, the 
volume of the system can decrease without limit. For a real gas this cannot 
happen because the repulsive core limits the close-packed density to some finite 
value. Therefore, as pressure is increased, the volume tends to some minimum 
value, V = V min — nb , where b is an experimental constant. The ideal gas 
equation of state must be corrected to take account of the existence of the 
repulsive core and assumes the form 

nRT 
V -nb' 

The attracive region of the potential causes the pressure to be decreased slightly 
relative to that of a noninteracting gas because it introduces a “cohesion” 
between molecules. The decrease in pressure will be proportional to the 
probability that two molecules interact; this, in turn, is proportional to the 
square of the density of particles ( N/V ) 2 . We therefore correct the pressure by a 
factor proportional to the square of the density, which we write a(n 2 /V 2 ). The 
constant a is an experimental constant which depends on the type of molecule 
being considered. The equation of state can now be written 

( P + ^Pj(V-nb) = nRT . (2.12) 


In Table 2. 1 we have given values of a and b for simple gases. 

The second virial coefficient for a van der Waals gas is easily found to be 

B { ™\T) = {b-^). (2.13) 

We see that B^^T) will be negative at low temperatures and will become 
positive at high temperatures but does not exhibit the maximum observed in real 
gases. Thus, the van der Waals equation does not predict all the observed 
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Table 2.1. Van der Waals Constants for Some 
Simple Gases [4] 



a (Pa ■ m 6 /mol 2 ) 

&(m 3 /mol) 

H 2 

0.02476 

0.02661 

He 

0.003456 

0.02370 

co 2 

0.3639 

0.04267 

h 2 o 

0.5535 

0.03049 

o 2 

0.1378 

0.03183 

n 2 

0.1408 

0.03913 


features of real gases. However, it describes enough of them to make it a 
worthwhile equation to study. In subsequent chapters, we will repeatedly use 
the van der Waals equation to study the thermodynamic properties of interacting 
fluids. 

2.C.4. Solids 

Solids have the property that their coefficient of thermal expansion, a P = 
(1 /v)(dv/dT) p , and their isothermal compressibility, k t = — (1 /v)(dv/dP) T , 
where v = V/n is the molar volume, are very small. Therefore, for solids at 
fairly low temperature we can expand the molar volume of a solid in a Taylor 
series about zero temperature and zero pressure value, vo, and obtain the 
following equation of state: 

v = vo(l + a P T - k t P), (2-14) 

where T is measured in Kelvins. Typical values [5] of k p are of the order of 
10~ ,0 /Pa or 10 _5 /atm. For example, for solid Ag (silver) at room temperature, 
kt = 1.3 x 10 -10 /Pa (for P = OPa), and for diamond at room temperature, 
«r = 1.6 x 10 -10 /Pa (for P = 4.0 x 10 8 Pa — 10 10 Pa). Typical values of a P 
are of the order 10~ 4 /K. For example, for solid Na (sodium) at room 
temperature we have a P = 2 x 10 _4 /K, and for solid K (potassium) we have 
a P = 2x 10“ 4 /K. 

2.C.5. Elastic Wire or Rod 

For a stretched wire or rod in the elastic limit, Hook’s law applies and we can 
write 


J = A(T)(L-Lo), (2.15) 

where J is the tension measured in Newtons per meter, A(T) is temperature 
dependent coefficient, L is the length of the stretched wire or rod, and L 0 is the 
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length of the wire when 7 = 0. The coefficient, A(T), may be written 
A(T) = Ao 4- Ai T, where A 0 and Ai are constants. The constant, A\, is negative 
for most substances but may be positive for some substances (including rubber). 


2.C.6. Surface Tension [6] 

Pure liquids in equilibrium with their vapor phase have a well-defined surface 
layer at the interface between the liquid and vapor phases. The mechanical 
properties of the surface layer can be described by thermodynamic state 
variables. The origin of the surface layer is the unequal distribution of 
intermolecular forces acting on the molecules at the surface. Molecules in the 
interior of the liquid are surrounded by, and interact with, molecules on all 
sides. Molecules at the surface interact primarily with molecules in the liquid, 
since the vapor phase (away from the critical point) is far less dense than the 
liquid. As a result, there is a strong tendency for the molecules at the surface to 
be pulled back into the liquid and for the surface of the liquid to contract. The 
molecular forces involved are huge. Because of this tendency for the surface to 
contract, work must be done to increase the free surface of the liquid. When the 
surface area is increased, molecules from the interior must be brought to the 
surface and therefore work must be done against interior molecular forces. The 
work per unit area needed to extend the surface area is called the surface 
tension of the liquid. For most pure liquids, the surface tension does not depend 
on the area and the equation of state has the form 

<T = <7 0 (l-^), (2.16) 

where t is the temperature in degrees Celius, <jq is the surface tension at 
t = 0°C, t' is experimentally determined temperature within a few degrees of 
the critical temperature, and n is an experimental constant which has a value 
between one and two. 

2.C.7. Electric Polarization [6-8] 

When an electric field E is applied to a dielectric material, the particles 
composing the dielectric will be distorted and an electric polarization field, P (P 
is the induced electric dipole moment per unit volume), will be set up by the 
material. The polarization is related to the electric field, E, and the electric 
displacement, D, by the equation 

D = e 0 E + P (2.17) 

where £q is the permittivity constant, £o = 8.854 x 10 -12 C 2 /N • m 2 . The 
electric field, E, has units of Newtons per coulomb (N/C), and the electric 
displacement and electric polarization have units of coulomb per square meter 
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(C/m 2 ). E results from both external and surface charges. The magnitude of the 
polarization field, P, will depend on the temperature. A typical equation of state 
for a homogeneous dielectric is 

p =(a + |) E ( 2 - 18 ) 

for temperatures not too low. Here a and b are experimental constants and T is 
temperature in degrees Kelvin. 

2.C.8. Curie’s Law [6-8] 

If we consider a paramagnetic solid at constant pressure, the volume changes 
very little as a function of temperature. We can then specify the state in terms of 
applied magnetic field and induced magnetization. When the external field is 
applied, the spins line up to produce a magnetization M (magnetic moment per 
unit volume). The magnetic induction field, B (measured in units of teslas, 1 
T=1 weber/m 2 ), the magnetic field strength, H (measured in units of amphere/ 
meter), and the magnetization are related through the equation 

B-moH + moM, (2.19) 

where /i 0 is the permeability constant (fio = 47 rx 10“ 7 T • m/A). The equation 
of state for such a system at room temperature is well approximated by Curie’s 
law: 


M = ^H, (2.20) 

where n is the number of moles, D is an experimental constant dependent on the 
type of material used, and the temperature, T, is measured in Kelvins. 


2.D. THE LAWS OF THERMODYNAMICS [6] 

Thermodynamics is based upon four laws. Before we can discuss these laws in a 
meaningful way, it is helpful to introduce some basic concepts. 

A system is in thermodynamic equilibrium if the mechanical variables do not 
change in time and if there are no macroscopic flow processes present. Two 
systems are separated by a fixed insulating wall (a wall that prevents transfer of 
matter heat and mechanical work between the systems) if the thermodynamic 
state variables of one can be changed arbitrarily without causing changes in the 
thermodynamic state variables of the other. Two systems are separated by a 
conducting wall if arbitrary changes in the state variables of one cause changes 
in the state variables of the other. A conducting wall allows transfer of heat. An 
insulating wall prevents transfer of heat. 
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It is useful to distinguish among three types of thermodynamic systems. An 
isolated system is one which is surrounded by an insulating wall, so that no heat 
or matter can be exchanged with the surrounding medium. A closed system is 
one which is surrounded by a conducting wall so that heat can be exchanged but 
matter cannot. An open system is one which allows both heat and matter 
exchange with the surrounding medium. 

It is possible to change from one equilibrium state to another. Such changes 
can occur reversibly or irreversibly. A reversible change is one for which the 
system remains infinitesimally close to the thermodynamic equilibrium — that 
is, is performed quasi-statically. Such changes can always be reversed and the 
system brought back to its original thermodynamic state without causing any 
changes in the thermodynamic state of the universe. For each step of a 
reversible process, the state variables have a well-defined meaning. 

An irreversible or spontaneous change from one equilibrium state to another 
is one in which the system does not stay infinitesimally close to equilibrium 
during each step. Such changes often occur rapidly and give rise to flows and 
“friction” effects. After an irreversible change the system cannot be brought 
back to its original thermodynamic state without causing a change in the 
thermodynamic state of the universe. With these ideas in mind, we an now 
discuss the four laws of thermodynamics. 

2.D.I. Zeroth Law: Two Bodies, Each in Thermodynamic 
Equilibrium with a Third System, are in Thermodynamic 
Equilibrium with Each Other 

The zeroth law is of fundamental importance to experimental thermodynamics 
because it enables us to introduce the concept of a thermometer and to measure 
temperatures of various systems in a reproducible manner. If we place a 
thermometer in contact with a given reference system, such as water at the 
triple point (where ice, water, and vapor coexist), then the mechanical variables 
describing the thermodynamic state of the thermometer (e.g., the height of a 
mercury column, the resistance of a resistor, or the pressure of a fixed volume 
container of gas) always take on the same values. If we then place the 
thermometer in contact with a third system and the mechanical variables do not 
change, then we say that the third system, the thermometer, and water at the 
triple point all have the same “temperature.” Changes in the mechanical 
variables of the thermometer as it is cooled or heated are used as a measure of 
temperature change. 

2.D.2. First Law: Energy Is Conserved 

The first law tells us that there is a store of energy in the system, called the 
internal energy, U, which can be changed by causing the system to do work, 
<jiW, or by adding heat, </lQ, to the system. We use the notation, flW, to indicate 
that the differential is not exact.) The change in the internal energy which 
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results from these two processes is given by 


dU = 4Q-4W. ( 2 . 21 ) 

The work, may be due to changes in any relevant extensive “mechanical” 
or chemical variable. In general it can be written 

fiW = PdV - JdL - odA - E • 7P — H • dM - 4>de - ^ pjdNj, (2.22) 

j 

where dU,dV,dL,dA,dP,dM,de, and dNj are exact differentials, but 
and flW are not because they depend on the path taken (on the way in 
which heat is added or work is done). The meaning of the first five terms 
in Eq. (2.22) was discussed in Section (2.C). The term, —cj>de, is the work 
the system needs to do it is has an electric potential, 4>, and increases its 
charge by an amount, de. The last term, -p'jdNj, is the chemical work 
required for the system to add dNj neutral particles if it has chemical 
potential, pj. We may think of - P, 7, <7, E, H, (f>, and Pj as generalized forces, 
and we may think of dV,dL, dA, 7P, dM , de and dNj as generalized 
displacements. 

It is useful to introduce a generalized mechanical force, Y, which denotes 
quantities such as, -P, 7, cr, E, H, and (j> , and a generalized displacement, X, 
which denotes the corresponding displacements, V,L,A,P,M, and e, 
respectively. Then the first law can be written in the form 

dU = 4Q+ YdX + Pj dN r ( 2 - 23 ) 

j 

Note that /xj is a chemical force and dNj is a chemical displacement. Note also 
that the pressure, P, has a different sign from the other generalized forces. If we 
increase the pressure, the volume increases, whereas if we increase the force, Y, 
for all other cases, the extensive variable, X, decreases. 

2.D.3. Second Law: Heat Flows Spontaneously from High 
Temperatures to Low Temperatures 

There are a number of ways to state the second law, with the one given above 
being the simplest. Three alternative versions are [6]: 

(a) The spontaneous tendency of a system to go toward thermodynamic 
equilibrium cannot be reversed without at the same time changing some 
organized energy, work, into disorganized energy, heat. 

(b) In a cyclic process, it is not possible to convert heat from a hot reservoir 
into work without at the same time transferring some heat to a colder 


reservoir. 
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(c) The entropy change of any system and its surroundings, considered 
together, is positive and approaches zero for any process which 
approaches reversibility. 

The second law is of immense importance from many points of view. From it 
we can compute the maximum possible efficiency of an engine which 
transforms heat into work. It also enables us to introduce a new state variable, 
the entropy, S, which is conjugate to the temperature. The entropy gives us a 
measure of the degree of disorder in a sysem and also gives us a means for 
determining the stability of equilibrium states, and, in general, it forms an 
important link between reversible and irreversible processes. 

The second law is most easily discussed in terms of an ideal heat engine first 
introduced by Carnot. The construction of all heat engines is based on the 
observation that if heat is allowed to flow from a high temperature to a lower 
temperature, part of the heat can be turned into work. Carnot observed that 
temperature differences can disappear spontaneously without producing work. 
Therefore, he proposed a very simple heat engine consisting only of reversible 
steps, thereby eliminating wasteful heat flows. The Carnot engine consists of 
the four steps shown in Fig. 2.2. These include: 

(a) Isothermal (constant temperature) absorption of heat AQn from a 
reservoir at a high temperature T h (we use A to indicate a finite rather 
than an infinitesimal amount of heat) (the process 1 — ► 2). 

(b) Adiabatic (constant heat content) change in temperature from r/, to the 
lower value r c (the process 2 — > 3). 



Fig. 2.2. A Carnot engine which runs on a substance with state variables, X and Y. The 
processes 1 — > 2 and 3 — > 4 occur isothermally at temperatures r h and r c , respectively. 
The processes 2 — ► 3 and 4 — > 1 occur adiabatically. The heat absorbed is AQ i2 and the 
heat ejected is AQ 43 . The shaded area is equal to the work done during the cycle. The 
whole process takes place reversibly. 
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(c) Isothermal expulsion of heat AQ 43 into a reservoir at temperature r c (the 
process 3 — > 4). 

(d) Adiabatic return to the initial state at temperature 77 , (the process 4 — ► 1). 

The work done by the engine during one complete cycle can be found by 
integrating the differential element of work Y dX about the entire cycle. We see 
that the total work A W tot done by the engine is given by the shaded area in Fig. 
2 . 2 . 

The total efficiency rj of the heat engine is given by the ratio of the work 
done to heat absorbed: 


_ A Wt ot 
V A Q n 


(2.24) 


Since the internal energy U is state variable and independent of path, the total 
change AU tot for one complete cycle must be zero. The first law then enables us 
to write 


A V M = Agio. - AW„ = 0 (2.25) 

and thus 

A W to t = A&ot = AQ n + AQ 34 = AQ n - AQ 43 . (2.26) 

If we combine Eqs. (2.24) and (2.26), we can write the efficiency in the form 

AQ43 


77= 1 - 


AQn 


(2.27) 


A 100% efficient engine is one which converts all the heat it absorbs into work. 
However, as we shall see, no such engine can exist in nature. 

The great beauty and utility of the Carnot engine lies in the fact that it is the 
most efficient of all heat engines operating between two heat reservoirs, each at 
a (different) fixed temperature. This is a consequence of the second law. To 
prove this let us consider two heat engines, A and B (cf. Fig. 2.3), which run 
between the same two reservoirs r/, and r c . Let us assume that engine A is a heat 
engine with irreversible elements and B is a reversible Carnot engine. We will 
adjust the mechanical variables X and Y so that during one cycle both engines 
perform the same amount of work (note that X A and Y A need not be the same 
mechanical variables as X B and y B ): 

A W* = A W* = AH'. (2.28) 

Let us now assume that engine A is more efficient than engine B: 


Va > V B 


(2.29) 
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Fig. 2.3. Two heat engines, A and B, work together. Engine B acts as a heat pump while 
engine A acts as a heat engine with irreversible elements. Engine A cannot have a greater 
efficiency than engine B without violating the second law. 


and thus 


or 


AW AW 

AQ 12 A<2®2 

A Q 12 > AQ^ 2 - 


(2.30) 


(2.31) 


We can use the work produced by engine A to drive the Carnot engine as a 
refrigerator. Since the Carnot engine is reversible, it will have the same 
efficiency whether it runs as a heat engine or as a heat pump. The work, AW, 
produced by A will be used to enable the Carnot engine B to pump heat from 
the low-temperature reservoir to the high-temperature reservoir. The net heat 
extracted from reservoir r c and delivered to reservoir 77 , is 

- AW - (A ef 2 - AW) = Ag? 2 - A e*. (2.32) 

If engine A is more efficient than engine B, then the combined system has 
caused heat to flow from low temperature to high temperature without any work 
being expended by an outside source. This violates the second law and therefore 
engine A cannot be more efficient than the Carnot engine. If we now assume 
that both engines are Carnot engines, we can show, by similar arguments, that 
they both must have the same efficiency. Thus, we reach the following 
conclusion: No engine can be more efficient than a Carnot engine, and all 
Carnot engines have the same efficiency. 

From the above discussion, we see that the efficiency of a Carnot engine is 
completely independent of the choice of mechanical variables X and Y and 
therefore can only depend on the temperatures 17 , and r c of the two reservoirs. 
This enables us to define an absolute temperature scale. From Eq. (2.27) we see 
that 


A j243 

AC?12 


=f(n,T C ), 


(2.33) 
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Fig. 2.4. Two Carnot engines running between three reservoirs with temperatures 
Th > t > t c have the same overall efficiency as one Carnot engine running between 
reservoirs with temperatures r/, > tc . 


where /(r/,, r c ) is some function of temperatures r/, and r c . The function 
f(Th,T c ) has a very special form. Let us consider two heat engines running 
between three reservoirs r h > r' > t c (cf. Fig. 2.4). We can write 


A043 

A012 


= f(Tk,T l ), 


(2.34) 


and 


so that 


A 065 , \ 

Ae 43 =/(T ’ Tc) ’ 

(2.35) 

A 065 j, ( \ 
AQ l2 ~ f ^ 

(2.36) 

£ 

% 

K 

"V 

£ 

II 

£ 

(2.37) 


Thus, /(t/,,t c ) = g(T/,)g _1 (r c ) where g(r) is some function of temperature. 

One of the first temperature scales proposed but not widely used is due to 
W. Thomson (Lord Kelvin) and is called the Thomson scale [9]. It has the form 


A £>43 _ g Tc ° 
AQn e T h 


(2.38) 


The Thomson scale is defined so that a given unit of heat A £>12 flowing between 
temperatures r° -4 (r° — 1) always produces the same amount of work, 
regardless of the value of r°. 
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A more practical scale, the Kelvin scale, was also introduced by Thomson. It 
is defined as 


A £>43 _ Tc_ 
A- 012 Th ’ 


(2.39) 


As we will see below, the Kelvin scale is identical to the temperature used in the 
ideal gas equation of state and is the temperature measured by a gas 
thermometer. For this reason, the Kelvin scale is the internationally accepted 
temperature scale at the present time. 

The units of degrees Kelvin are the same as degrees Celsius. The ice point of 
water at atmospheric pressure is defined as 0 °C, and the boiling point is defined 
as 100 °C. The triple point of water is 0.01 °C. To obtain a relation between 
degrees Kelvin and degrees Celsius, we can measure pressure of a real dilute 
gas as a function of temperature at fixed volume. It is found experimentally 
that the pressure varies linearly with temperature and goes to zero at 
t c = — 273.1 5 °C. Thus, from the ideal gas law, we see that degrees Kelvin, 
T, are related to degrees Celsius, t c , by the equation 

T = (t c + 273.15). (2.40) 


The triple point of water is fixed at T = 273.16 K. 

In Exercise 2.2, we compute the efficiency of a Carnot engine which uses an 
ideal gas as an operating substance. However, Carnot engines can be 
constructed using any of a variety of substances (some examples are left as 
problems). Regardless of the operating substance, all Carnot engines have the 
same efficiency. 


■ Exercise 2.2. Compute the efficiency of a Carnot cycle (shown in the 
figure below) which uses a monatomic ideal gas as an operating substance. 



Answer: The equation of state of an ideal gas is PV = nRT, where P = —Y 
is the pressure and V = X is the volume, T is the temperature in Kelvins, and 
n is the number of moles. The internal energy is U — (3/2 )nRT. The Carnot j 
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cycle for an ideal gas is shown in the figure below. The part 1 — » 2 is an j 

isothermal expansion of the system, and the path 3 — » 4 is an isothermal I 

contraction. It is clear from the equation of state that the temperature, 7/,, of 
path 1 — > 2 is higher than the temperature, T c , of path 3 — * 4. The path 
2 — * 3 is an adiabatic expansion of the system, and the path 4 — > 1 is an 
adiabatic contraction. We shall assume that n is constant during each cycle. 

Let us first consider the isothermal paths. Since the temperature is 
constant along these paths, dU = \nRdT = 0. Thus, along the path i 

1 -» 2 ,0Q = 0W = nRT h (dV/V). The heat absorbed along the path 

1 2 is 

{ Vl dV fVo\ 

AQ n =nRT h — = nRT h In . (1) 

The heat absorbed along the path 3 — > 4 is 

AQ M = nRT c ]n(^j. (2) 

Since V 2 > Vj , AQu > 0 and heat is absorbed along the path 1 — > 2. Since 
V 3 > V 4 , AQ 34 < 0 and heat is ejected along the path 3 -> 4. 

Let us next consider the adiabatic paths. Along the adiabatic path, 
<flQ — 0 = dU + PdV = (3/2 )nRdT + PdV. If we make use of the equation 
of state, we find (3/2 ){dT/T) = -(dV/V). We now integrate to find 
r 3 / 2 V = constant for an adiabatic process. Thus, along the paths 2 — ► 3 and 
4 — ► 1, respectively, we have 

T c V ^ /3 = T h V 2 /3 and T C V 2 J 3 = T h v\ /3 . (3) 

For the entire cycle, we can write A£/ to t = A Q tot — AW tot = 0. Thus 

j AW m = A & ot = AQ , 2 + AQ 34 . The efficiency of the Carnot cycle is | 

[ 

aw m a Qm . r c in(v 3 /v 4 ) r c 

V A Qn A e, 2 r,ln(V 2 /Vi) T h ’ U 

j since from Eq. (3) we have 

! v 1 = v 1 

[ v 4 vr 


We can use the Carnot engine to define a new state variable called the entropy. 
All Carnot engines have an efficiency 


*7=1- 


A(?43 

AQ\2 



(2.41) 
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Fig. 2.5. An arbitrary reversible heat en- 
gine is composed of many infinitesimal 
Carnot engines. The area enclosed by the 
curve is equal to the work done by the heat 
engine. 


(cf. Exercise 2.2) regardless of operating substance. Using Eq. (2.41), we can 
write the following relation for a Carnot cycle: 


AQn A Q 34 
T h T c 


(2.42) 


(note the change in indices in AQ 24 ). Equation (2.42) can be generalized to the 
case of an arbitrary reversible heat engine because we can consider such an 
engine as being composed of a sum of many infinitesimal Carnot cycles (cf. 
Fig. 2.5). Thus, for an arbitrary reversible heat engine we have 


The quantity 



(2.43) 


(2.44) 


is an exact differential and the quantity 5, called the entropy, may be considered 
a new state variable since the integral of dS about a closed path gives zero. 

No heat engine can be more efficient than a Carnot engine. Thus, an engine 
which runs between the same two reservoirs but contains spontaneous or 
irreversible processes in some part of the cycle will have a lower efficiency, and 
we can write 


A <243 ^ T c 

A& 2 T h 

(2.45) 

AQ i2 AQti , „ 

T h T c 

(2.46) 


For an arbitrary heat engine which contains an irreversible part, Eq. (2.46) gives 
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the very important relation 

< 0. (2.47) 

For an irreversible process, flQ/T can no longer be considered an exact 
differential. 

A system may evolve between two thermodynamic states either by a 
reversible path or by a spontaneous, irreversible path. For any process, 
reversible or irreversible, the entropy change depends only on the initial and 
final thermodynamic states of the system, since the entropy is a state variable. If 
the system evolves between the initial and final states via a reversible path, then 
we can compute the entropy change along that path. However, if the system 
evolves between the initial and final states via an irreversible path, then we must 
construct a hypothetical reversible path between the initial and final states in 
order to use the equations of thermodynamics to compute the entropy change 
during the spontaneous process. For the irreversible path, the heat absorbed by 
the system will be less than that along the reversible path [cf. Eqs. (2.45)- 
(2.47)]. Therefore, J irrev flQ/T < J rev fiQ/T. This means that for an irreversible 
process , [ mev flQ/T does not contain all contributions to the entropy change. 
Some of it comes from the disorder created by spontaneity. This result is usually 
written in the form 

dS = ^ + d,S, (2.48) 

where d t S denotes the entropy production due to spontaneous processes. For a 
reversible process, d t S = 0 and dS = (1 /T)flQ so the entropy change is entirely 
due to a flow of heat into or out of the system. For a spontaneous (irreversible)) 
process, d t S > 0. 

For an isolated system we have flQ = 0, and we obtain the important relation 

dS = diS > 0, (2.49) 

where the equality holds for a reversible process and the inequality holds for a 
spontaneous or irreversible process. Since the equilibrium state is, by definition, 
a state which is stable against spontaneous changes, Eq. (2.49) tells us that the 
equilibrium state is a state of maximum entropy. As we shall see, this fact gives 
an important criterion for determining the stability of the equilibrium state for 
an isolated system. 

2.D.4. Third Law: The Difference in Entropy Between States 
Connected by a Reversible Process Goes to Zero in the 
Limit T -> 0 K [9-11] 

The third law was first proposed by Nemst in 1906 on the basis of experimental 
observations and is a consequence of quantum mechanics. Roughly speaking, a 
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Fig. 2.6. The fact that curves 7 = 0 and Y = Y\ must approach the same point (the third 
law) makes it impossible to reach absolute zero by a finite number of reversible steps. 


system at zero temperature drops into its lowest quantum state and in this sense 
becomes completely ordered. If entropy can be thought of as a measure of 
disorder, then at T = 0 K it must take its lowest value. 

An alternative statement of the third law, and a direct consequence of the 
above statement, is, It is impossible to reach absolute zero in a finite number of 
steps if a reversible process is used. This alternative statement is easily 
demonstrated by means of a plot in the S-T plane. In Fig. 2.6 we have plotted 
the curves as a function of S and T for two states Y = 0 and 7 = 7j for an 
arbitrary system. (A specific example might be a paramagnetic salt with 
Y = H.) We can cool the system by alternating between the two states, 
adiabatically and isothermally. From Eqs. (2.5) and (2.6), we write 



As we shall show in Section 2.H, thermal stability requires that ( dS/dT) Y > 0. 
Equation (2.50) tells us that if T decreases as Y increases isentropically, then S 
must decrease as Y decreases isothermally, as shown in Fig. 2.6. For the process 
1 — > 2 we change from state Y = Y\ to state 7 = 0 isothermally, thus squeezing 
out heat, and the entropy decreases. For process 2 — > 3, we increase 7 
adiabatically from 7 = 0 to 7 = Y\ and thus decrease the temperature. We can 
repeat these processes as many times as we wish. However, as we approach 
T = 0 K, we know by the third law that the two curves must approach the same 
point and must therefore begin to approach each other, thus making it 
impossible to reach T = 0 K in a finite number of steps. 

Another consequence of the third law is that certain derivatives of the 
entropy must approach zero as T — * 0 K. Let us consider a process at T = 0 K 
such that 7 — > 7 + dY and X — ♦ X + dX. Then the change in entropy if Y, T, and 
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N are chosen as independent variables is (assume dN = 0) 


dS = 



dY , 

N,T = 0 


(2.51) 


or if X , I ’ and A are chosen as independent we obtain 





</X. 


V,T=0 


(2.52) 


Thus, if the states (F, 7 = OK) and (Y + dY,T = OK) or the states 
(X, T — 0 K) and (X + dX , T = 0 K) are connected by a reversible process, 
we must have dS = 0 (third law) and therefore 


and 


'ds\ 

dYj ~ ^ 

(2.53) 

dS\ 

dx) =0 

N,T = 0 

(2.54) 


Equations (2.53) and (2.54) appear to be satisfied by real substances. 


2.E. FUNDAMENTAL EQUATION OF 
THERMODYNAMICS [10] 

The entropy plays a central role in both equilibrium and nonequilibrium 
thermodynamics. It can be thought of as a measure of the disorder in a system. 
As we shall see in Chapter 7, entropy is obtained microscopically by state 
counting. The entropy of an isolated system is proportional to the logarithm of 
the number of states available to the system. Thus, for example, a quantum 
system in a definite quantum state (pure state) has zero entropy. However, if the 
same system has finite probability of being in any of a number of quantum 
states, its entropy will be nonzero and may be quite large. 

The entropy is an extensive, additive quantity. If a system is composed of a 
number of independent subsystems, then the entropy of the whole system will 
be the sum of the entropies of the subsystems. This additive property of the 
entropy is expressed mathematically by the relation 


S(XU, AX, {AA,}) = XS(U,X,{Ni}). (2.55) 

That is, the entropy is a first-order homogeneous function of the extensive state 
variables of the system. If we increase all the extensive state variables by a 
factor A, then the entropy must also increase by a factor A. 
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Differential changes in the entropy are related to differential changes in the 
extensive state variables through the combined first and second laws of 
thermodynamics: 


TdS >4Q = dU-YdX-^2 ^j dN r (2.56) 

j 


The equality holds if changes in the thermodynamic state are reversible. The 
inequality holds if they are spontaneous or irreversible. Equations (2.55) and 
(2.56) now enable us to define the Fundamental Equation of thermodynamics. 
Let us take the derivative of AS with respect to A: 



However, from Eq. (2.56) we see that 


\w)x,m 



m = 

\oxJvm 


Y 

T’ 


(2.57) 


(2.58) 

(2.59) 


and 


/an = _4 

W w ii,r t 


(2.60) 


Equations (2.58)-(2.60) are called the thermal, mechanical, and chemical 
equations of state , respectively. The mechanical equation of state, Eq. (2.59), is 
the one most commonly seen and is the one which is described in Section 2.C. 
If we now combine Eqs (2.57)-(2.60), we obtain 


TS = U - XY - ^2 Vj N j- (2.61) 

j 

Equation (2.61) is called the Fundamental Equation of thermodynamics (it is 
also known as Euler’s equation) because it contains all possible thermodynamic 
information about the thermodynamic system. If we take the differential of Eq. 
(2.61) and subtract Eq. (2.56) (we will take the reversible case), we obtain 
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another important equation, 


SdT + XdY + Njdfi'j = 0, (2.62) 

j 

which is called the Gibbs-Duhem equation. The Gibbs-Duhem equation relates 
differentials of intensive state variables. 

For a monatomic system, the above equations simplify somewhat if we 
work with densities. As a change of pace, let us work with molar densities. For 
single component system the Fundamental Equation can be written TS = U— 
YX — fin and the combined first and second laws (for reversible processes) can 
be written TdS = dU — YdX — fidn. Let us now introduce the molar entropy, 
s = S/n, the molar density, x — Xjn , and the molar internal energy, u = U/n. 
Then the Fundamental Equation becomes 

Ts = u — Yx — fi, (2.63) 

and the combined first and second laws become (for reversible processes) 

Tds = du— Ydx. (2.64) 

Therefore, ( ds/du) x = \/T and (ds/dx) u = -Y/T. The Gibbs-Duhem equa- 
tion is simply 


dfjL = — sdT — xdY , (2.65) 

and therefore the chemical potential has the form, f. i = n(T,Y), and is a 
function only of the intensive variables, T and Y Note also that s = —(dfi/dT) Y 
and x = —(dfi/dY) T . In Exercise 2.3, we use these equations to write the 
Fundamental Equation for an ideal monatomic gas. 


! ■ EXERCISE 2.3. The entropy of n moles of a monatomic ideal gas is 
J S = (5/2)nR + nR\n[(V/Vo)(no/n)(T/To) 3 / 2 ], where Vb,«o, anc * To are 
constants (this is called the Sackur-Tetrode equation ). The mechanical 
| equation of state is PV — nRT. (a) Compute the internal energy, (b) 

| Compute the chemical potential, (c) Write the Fundamental Equation for an 
ideal monatomic ideal gas and show that it is a first-order homogeneous 
j function of the extensive state variables. 

j Answer: It is easiest to work in terms of densities. The molar entropy can be [ 
written s = (5/2)/? + Rln[(v/vo)(T/7o) 3 ^ 2 ](v = V/n is the molar volume), j 
| and the mechanical equation of state is Pv = RT. j 
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(a) The combined first and second law gives du = Tds - Pdv. If we 
further note that ds = ( ds/dT) v dT + (ds/dv) T dv, then 


du = T 



dT + 



(i) 


since (ds/dT) v = (3R/2T) and {ds/dv) T = R/v. Therefore, the 
molar internal energy is u = \RT + «o where uo is a constant, and 
the total internal energy is U = nu = \ nRT + Uo, where Uo = nu Q . 

(b) Let us rewrite the molar entropy in terms of pressure instead of molar 
volume. From the mechanical equation of state, v = (RT /P) and 
v 0 = (RTq/Po). Therefore, 5 = §P + R\n[(Po/P){T/T 0 ) 5 ' 2 ]. From the 
Gibbs-Duhem equation, (dfi/dT) p = — s = — (|P+ Pln[(Po/P) 
(T/To) 5 / 2 }) and (dn/dP) T = v = RT/P. If we integrate these we 
obtain the following expression for the molar chemical potential: 


fi = —RT In 


[Vo /r\ 5/2 ^ 
p VV 


(2) 


(c) Let us rewrite the entropy in terms of the internal energy, volume, and 
number of moles. We obtain 


S = 


- nR + nR In 
2 


Lm 5/2 (£\ V2 ' 

Vo U; 


(3) 


Equation (3) is the Fundamental Equation for an ideal monatomic gas. 
It clearly is a first-order homogeneous function of the extensive 
variables. 

It is interesting to note that this classical ideal gas does not obey the 
third law of thermodynamics and cannot be used to describe systems 
at very low temperatures. At very low temperatures we must include 
quantum corrections to the ideal gas equation of state. 


2.F. THERMODYNAMIC POTENTIALS [11] 

In conservative mechanical systems, such as a spring or a mass raised in a 
gravitational field, work can be stored in the form of potential energy and 
subsequently retrieved. Under certain circumstances the same is true for 
thermodynamic systems. We can store energy in a thermodynamic system by 
doing work on it through a reversible process, and we can eventually retrieve 
that energy in the form of work. The energy which is stored and retrievable in 
the form of work is called the free energy. There are as many different forms of 
free energy in a thermodynamic system as there are combinations of 
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constraints. In this section, we shall discuss the five most common ones: 
internal energy, U; the enthalpy, H\ the Helmholtz free energy, A ; the Gibbs free 
energy, G; and the grand potential, fl. These quantities play a role analogous to 
that of the potential energy in a spring, and for that reason they are also called 
the thermodynamic potentials. 


2.F.I. Internal Energy 

From Eq. (2.61) the fundamental equation for the internal energy can be 
written 


U = ST + YX + p'jNj, (2.66) 

j 

where T, Y, and p!j are considered to be functions of S, X, and {Nj} [cf. Eqs. 
(2.58)-(2.60)]. From Eq. (2.56), the total differential of the internal energy can 
be written 


dU < TdS + YdX + ^2 ^j dN J- 
j 

(2.67) 

The equality holds for reversible changes, and the inequality holds for changes 
which are spontaneous. From Eq. (2.67) we see that 

II 

'213 

* 

3 

(2.68) 


(2.69) 


and 



(dU\ 

\ dN jJ S,X,{N ¥ j} 


(2.70) 


We can use the fact that dU is an exact differential to find relations between 
derivatives of the intensive variables, T, Y, and p'j. From Eq. (2.4) we know, for 
example, that 


\± 

IdX 





S,{Nj } 


d_ 

dS 

L 







(2.71) 
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From Eqs. (2.68), (2.69), and (2.71), we obtain 



m . 

\dSJx,{Nj} 


(2.72) 


(i + 1) additional relations like Eq. (2.72) exist and lead to the identities 


and 


dT\ 

dNj) 


sMth#) 


8Y\ 

dN jJ sj,{n ¥ j } 


) x,{Nj} 

M\ 

s,{Nj} 


f d Vj\ = fM\ 

\pNiJ sj,{N ¥i } \ dN jJ SJ,{N ¥ j} 


(2.73) 

(2.74) 


(2.75) 


Equations (2.72)-(2.75) are extremely important both theoretically and 
experimentally because they provide a relation between rates of change of 
seemingly diverse quantities. They are called Maxwell relations. 

For a substance with a single type of particle, the above equations simplify if 
we work with densities. Let u = U/n denote the molar internal energy. Then the 
Fundamental Equation can be written u = Ts + Yx + \i, where s is the molar 
entropy and x is a molar density. The combined first and second laws (for 
reversible processes) are du = Tds + Ydx. Therefore we obtain the identities 
(< du/ds) x = T and ( du/dx) s = Y. Maxwell relations reduce to ( dT/dx) s = 
\dY/ds) x . 

The internal energy is a thermodynamic potential or free energy because for 
processes carried out reversibly in an isolated, closed system at fixed X and 
{Nj}, the change in internal energy is equal to the maximum amount of work 
that can be done on or by the system. As a specific example, let us consider a 
PVT system (cf. Fig. 2.7). We shall enclose a gas in an insulated box with fixed 
total volume and divide it into two parts by a movable conducting wall. We can 
do work on the gas or have the gas do work by attaching a mass in a 
gravitational field to the partition via a pulley and insulated string. To do work 
reversibly, we assume that the mass is composed of infinitesimal pieces which 
can be added or removed one by one. If P\A + mg> P 2 A, then work is done on 
the gas by the mass, and if P\A + mg < P 2 A, the gas does work on the mass. 
The first law can be written 


AU = AQ- AW, 


(2.76) 
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Fig. 2.7. For a reversible process in a closed, 
insulated box of fixed size ( AS = 0, AV = 0, 
ANj = 0), the work done in lifting the weight 
will be equal to the change in the internal 
energy, (AU) SVN = -A W fIte . 

where AU is the change in total internal energy of the gas, A Q is the heat flow 
through the walls, and AW can be divided into work done due to change in size 
of the box, J PdV, and work done by the gas in raising the weight, AW^e- 

AW = J/W + AW free . (2.77) 

For a reversible process, A Q — J TdS. For the reversible process pictured in 
Fig. 2.7, A Q = 0, AV = 0, and ANj = 0 (if no chemical reactions take place). 
Therefore, 

(Af/) s v ,{^.} = — AWfree- (2.78) 

Thus, for a reversible process at constant S, V, and Nj, work can be stored in the 
form of internal energy and can be recovered completely. Under these 
conditions, internal energy behaves like a potential energy. 

For a spontaneous process, work can only be done at constant S, V, and {N,} 
if we allow heat to leak through the walls. The first and second laws for a 
spontaneous process take the form 

TdS-^PdV- AW free + J2 jp.jdNj, (2.79) 

where the integrals are taken over a reversible path between initial and final 
states and not the actual spontaneous path. We can do work on the gas 
spontaneously by allowing the mass to drop very fast. Then part of the work 
goes into stirring up the gas. In order for the process to occur at constant 
entropy, some heat must leak out since A Q < J TdS = 0. Thus, for a 
spontaneous process 

(Af/) 5 v,{A^} < — ^ AWfree- (2.80) 

Not all work is changed to internal energy and is retrievable. Some is wasted in 
stirring the gas. (Note that for this process the entropy of the universe has 
increased since heat has been added to the surrounding medium.) 
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For processes involving mechanical variables Y and X we can write Eqs. 
(2.78) and (2.80) in the form 


(A^W,} < (-AWfree), (2.81) 

where AWf ree is any work done by the system other than that required to change 
X. For a reversible process at constant S, X, and { Nj }, work can be stored as 
internal energy and can be recovered completely. 

If a process takes place in which no work is done on or by the system, then 
Eq. (2.81) becomes 

(± v )s*m ^ 0 < 2 - 82 ) 

and the internal energy either does not change (reversible process) or decreases 
(spontaneous process). Since a system in equilibrium cannot change its state 
spontaneously, we see that an equilibrium state at fixed S, X, and {/V, } is a state 
of minimum internal energy. 

2.F.2. Enthalpy 

The internal energy is the convenient potential to use for processes carried out 
at constant X, S, and {Nj}. However, it often happens that we wish to study the 
thermodynamics of processes which occur at constant S, Y, and {Nj}. Then it is 
more convenient to use the enthalpy. 

The enthalpy, H, is useful for systems which are thermally isolated and 
closed but mechanically coupled to the outside world. It is obtained by adding 
to the internal energy an additional energy due to the mechanical coupling: 

h = u-xy = st + J2 Vjty- (2.83) 

j 

The addition of the term —XY has the effect of changing the independent 
variables from (S,X,Nj) to (5, Y, Nj ) and is called a Legendre transformation. If 
we take the differential of Eq. (2.83) and combine it with Eq. (2.67), we obtain 

dH <TdS-XdY + '^T h dN J (2.84) 


and, therefore, 


T = 



(2.85) 


X = 


dH' 

dY 


s,m 


( 2 . 86 ) 
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and 



dH\ 

P N j) S,Y,{N ¥j } 


(2.87) 


Since dH is an exact differential, we can use Eq. (2.4) to obtain a new set of 
Maxwell relations: 




( 2 . 88 ) 


(dT\ _ ( d ^j\ 

\d N jJ s,y,{n ¥ j} \® s / y,{Nj} 


(2.89) 


/dX_\ 

\ dN jJ S,Y,{N ¥j ) 



(2.90) 


and 


= (M 

s.r.iNu,} \ dN J 

which relate seemingly diverse partial derivatives. 

For a substance with a single type of molecule, Eqs. (2.84)-(2.91) become 
particularly simple if we work with densities. Let h = H/n denote the molar 
enthalpy. Then the fundamental equation for the molar enthalpy can be written 
h = u — xY = sT + fi. The exact differential of the molar enthalpy is dh = 
Tds — xdY (for reversible processes), which yields the identities ( dh/ds) Y = T 
and ( dh/dY) s = x. Maxwell’s relations reduce to ( dT/dY) s = — (dx/ds) Y . In 
Exercise 2.4, we compute the enthalpy for a monatomic ideal gas in terms of its 
natural variables. 


S,Y,{N ¥j } 


(2.91) 



} ■ EXERCISE 2.4. Compute the enthalpy for n moles of a monatomic 
ideal gas and express it in terms of its natural variables. The mechanical 
equation of state is PV = nRT and the entropy is S = \ nR + n/?ln[(V'/Vo) 

(« 0 /«)(r/r 0 ) 3/2 ]. 

Answer: Let us write the molar entropy in terms of temperature and 
pressure. It is s = |/? + R\n[(Po/P)(T /Toy/ 2 ]. Also note that when P = Po 
and T = Tq, s = sq = ^R. Now since dh = Tds + vdP we have 


I 



T= To 



2/5 

e (s-so)/so 


(1) 
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and 



( 2 ) 


If we integrate, we find h = ^RTo(P/Po) 2 ^ 5 e^ s ~ s °^ s ° = \RT. In terms of 
temperature, the enthalpy is h = \RT. There is an easier way to obtain these 
results. From Exercise 2.3, the molar internal energy is u = \RT. The 
fundamental equation for the molar enthalpy is h — u + vP, where v = V/n 
is the molar volume. Since v = RT/P, we obtain h = | RT and H = \nRT. 


For a YXT system, the enthalpy is a thermodynamic potential for reversible 
processes carried out at constant Y. The discussion for the enthalpy is 
completely analogous to that for the internal energy except that now we allow 
the extensive variable X to change and maintain the system at constant Y. We 
then find 


A H< 



XdY-AW free + 



(2.92) 


where the equality holds for a reversible process and the inequality holds for a 
spontaneous process (AW free is defined in Section 2.F.1). Therefore, 


(A H) svm < (-AWW) (2.93) 

and we conclude that, for a reversible process at constant S, Y, and {Nj}, work 
can be stored as enthalpy and can be recovered completely. 

If a process takes place at constant S, Y, and {A^ } in which no work is done 
on or by the system, then 


(A H) S Yt {Nj} < o. (2.94) 

Since the equilibrium state cannot change spontaneously, we find that the 
equilibrium state at fixed S, Y, and {Nj} is a state of minimum enthalpy. 

2.F.3. Helmholtz Free Energy 

For processes carried out at constant T, X, and {A;}, the Helmholtz free energy 
corresponds to a thermodynamic potential. The Helmholtz free energy, A, is 
useful for systems which are closed and thermally coupled to the outside world 
but are mechanically isolated (held at constant X). We obtain the Helmholtz free 
energy from the internal energy by adding a term due to the thermal coupling: 

A = U - ST = YX + y^n' j N j . 


(2.95) 
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The addition of -ST is a Legendre transformation which changes the 
independent variables from (S,X, {Nj}) to (T,X, {Nj}). If we take the 
differential of Eq. (2.95) and use Eq. (2.67), we find 

dA < —SdT + YdX + Yl V j dN j • (2.96) 


Therefore, 


and 




dA\ 

\dx). 


r-m 




, ( dA\ 

M,- = I — 1 


1 \dNjJ t,x,{n ¥ j } 
Again, from Eq. (2.4), we obtain Maxwell relations 


dS\ 

dXj T ,{Nj} 


dY 

dT 




W 


ay\ 

dNj) 


— © 
TMN ¥ j} / 






V0*/ r,{jv,} 


(2.97) 

(2.98) 

(2.99) 

(2.100) 

( 2 . 101 ) 

(2.102) 


and 


( d A\ = 

W7 r,X,{V ¥ ,} W T,X,{N ¥ j} 


(2.103) 


for the system. 

We can write the corresponding equations in terms of densities. Let us 
consider a monatomic substance and let a = A/ n denote the molar Helmholtz 
free energy. Then the fundamental equations for the molar Helmholtz free 
energy is a = u — sT = xY + fi. The combined first and second laws (for 
reversible processes) can be written da = —sdT + Ydx so that ( da/dT) x = —s 
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and (< da/dx) T = Y. Maxwells relations reduce to ( ds/dx) T = —{dY/dT) x . In 
Exercise 2.5, we compute that Helmholtz free energy for a monatomic ideal gas 
in terms of its natural variables. 


■ EXERCISE 2.5. Compute the Helmholtz free energy for n moles of a 
monatomic ideal gas and express it in terms of its natural variables. The 
mechanical equation of state is PV = nRT and the entropy is 
S = \nR + nR\n[(V/Vo)(no/n)(T/T 0 f 2 }. 


Answer: Since da = —sdT — Pdv we have 


and 


da\ 5 d r v /r\ 3/2 " 

— =-s = --R-Rln — ( — 

oT J v 2 vo \TqJ 


M = _ P = _?I 


dv J ■ 


a) 


( 2 ) 


If we integrate, we find a = -RT - RT\n[(v/vo)(T/To) 3/2 ] and A = 
- nRT - nmn[(V/V 0 )(n 0 /n)(r/r 0 ) 3/2 ]. 


For a YXT system, the Helmholtz free energy is a thermodynamic potential 
for reversible processes carried out at constant T, X ', and {A^}. For a change in 
the thermodynamic state of the system, the change in the Helmholtz free energy 
can be written 


AA < 


SdT + 


YdX- AW free + 




(2.104) 


where the inequality holds for spontaneous processes and the equality holds for 
reversible processes (AWfr ee is defined in Section 2.66). For a process carried 
out at fixed T, X , and {A^}, we find 


(AA) rx{JVj) < (-AM, (2-105) 

and we conclude that for a reversible process at constant T, X, and {A^}, work 
can be stored as Helmholtz free energy and can be recovered completely. 

If no work is done for a process occurring at fixed T, X , and {A^}, Eq. (2.105) 
becomes 


(AA) r x < 0. (2.106) 

Thus, an equilibrium state at fixed T, X , and (A/)} is a state of minimum 
Helmholtz free energy. 
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2.F.4. Gibbs Free Energy 

For processes carried out at constant Y, T, and {Nj}, the Gibbs free energy 
corresponds to the thermodynamic potential. Such a process is coupled both 
thermally and mechanically to the outside world. We obtain the Gibbs free 
energy, G, from the internal energy by adding terms due to the thermal and 
mechanical coupling, 


G = U-TS-XY = Y2 VjHj- (2- 107) 

i 


In this way we change from independent variables (S, X, {A^}) to variables (T,Y, 
{Ni}). If we use the differential of Eq. (2.106) in Eq. (2.67), we obtain 

dG < —SdT - XdY + ^/y#V / , (2.108) 


S = 



(2.109) 


and 



( 2 . 110 ) 



fdG\ 

\d N jJ t iY '{ N¥ .} 


(2.111) 


The Maxwell relations obtained from the Gibbs free energy are 






dS_\ 

P N jJ T,Y,{N ¥j } 





SdX\ _ /<9/A 

\dNjJ T Y ^N ¥j ) ) T,{Nj} 


and 


= fM\ 

\dNiJ T,Y,{N ¥i } \d N jJ t,y,{n ¥ j } 


(2.112) 

(2.113) 

(2.114) 


(2.115) 


and again relate seemingly diverse partial derivatives. 
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As we found in earlier sections, we can write the corresponding equations 
in terms of densities. We will consider a monomolecular substance and let 
g — Gjn denote the molar Gibbs free energy. Then the fundamental equation 
for the molar Gibbs free energy is g = u-sT-xY = p and the molar 
Gibbs free energy is equal to the chemical potential (for a monomolelcular 
substance). The combined first and second laws (for reversible processes) can 
be written dg — -sdT - xdY so that ( dg/dT) Y = -5 and ( dg/dY) T = -x. 
Maxwells relations reduce to ( ds/dY) T = +{dx/dT) Y . For a monatomic 
substance, the molar Gibbs free energy is equal to the chemical potential. 

For a YXT system, the Gibbs free energy is a thermodynamic potential for 
reversible processes carried out at constant T, Y, and {Nj}. For a change in the 
thermodynamic state of the system, the change in Gibbs free energy can be 
written 


AG < - 


SdT- XdY-AW {T ee + Y2 V i dN /> (2-116) 


where the equality holds for reversible processes and the inequality holds for 
spontaneous processes (AWf ree is defined in Section 2.F.1). For processes at 
fixed T, Z and (A)}, we obtain 


(AG) WA6} < (-AWfree). (2.117) 

Thus, for a reversible process at constant T, Y, and {Nj}, work can be stored as 
Gibbs free energy and can be recovered completely. 

For a process at fixed T, Y, and {Nj} for which no work is done, we 
obtain 


(AG) TY ^.j < 0, (2.118) 

and we conclude that an equilibrium state at fixed T, Y, and {Nj} is a state of 
minimum Gibbs free energy. 


■ EXERCISE 2.6. Consider a system which has the capacity to do work, 
= —YdX + 0W'. Assume that processes take place spontaneously so 
that dS = ( \/T)</lQ + djS, where diS is a differential element of entropy due 
to the spontaneity of the process. Given the fundamental equation for the 
Gibbs free energy, G = U - XY — TS, show that ~(dG) YT = <flW' + TdfS. 
Therefore, at fixed Y and T, all the Gibbs free energy is available to do work 
for reversible processes. However, for spontaneous processes, the amount of 
work that can be done is diminished because part of the Gibbs free energy is 
used to produce entropy. This result is the starting point of nonequilibrium 
thermodynamics. 
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Answer: From the fundamental equation for the Gibbs free energy, we 
know that dG = dU -XdY -Y dX-T dS-SdT. Also we know that 
dU = flQ + YdX- flW', so we can write dG = flQ — flW 1 — XdY— 
TdS — SdT. For fixed Y and T we have ( dG) Y T = flQ — flW' — T dS. Now 
remember that dS = ( \/T)4Q + d ( S. Then we find (dG) Y T = -fiW’ - TdiS. 
Note that the fundamental equation, G = U — XY — TS contains the starting 
point of nonequilibrium thermodynamics. 


For mixtures held at constant temperature, T, and pressure, P, the Gibbs free 
energy is a first-order homogeneous function of the particle numbers or particle 
mole numbers and this allows us to introduce a “partial” free energy, a 
“partial” volume, a “partial” enthalpy, and a “partial” entropy for each type of 
particle. For example, the chemical potential of a particle of type i, when 
written as fit = (dG/drii) T P is a partial molar Gibbs free energy. The total 
Gibbs free energy can be written 


^ V- ( dG \ 

G = Z^nita = Z^nA — 1 

fc T i=i \ an 'ST,p,M 


(2.119) 


The partial molar volume for a particle of type i is v, = (dV/dtii) T P ^ j, and 
the total volume can be written 


,= i 


niVi = 



(2.120) 


The partial molar entropy for particle of type i is s, = (dS/dn,i) T P j, and the 
total entropy can be written 


N 

S = H-iSi — 
i= 1 



( 2 . 121 ) 


Because the enthalpy is defined, H = G + TS, we can also define a partial molar 
enthalpy, hi = {dH/drii ) TP ^ ^ = //,• + Ts{. Then the total enthalpy can be 
written H = These quantities are very useful when we describe the 

properties of mixtures in later chapters. 


! ■ EXERCISE 2.7. Consider a fluid with electric potential, <f>, containing v 
different kinds of particles. Changes in the internal energy can be written, 
dU — flQ — PdV + cf)de + J2j Pjdtij. Find the amount of Gibbs free energy 
needed to bring dn t moles of charged particles of type i into the system at 
| fixed temperature, pressure, and particle number, nj(j ^ i), in a reversible 
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manner. Assume that particles of type i have a valence, z,-. Note that the 
amount of charge in one mole of protons is called a Faraday , F. 

Answer: The fundamental equation for the Gibbs free energy, 
G = U + PV - TS, yields dG = dU + PdV + VdP-TdS- SdT. There- 
fore, dG — flQ + (fide + fijdrij + V dP — TdS — SdT. For a reversible 
process, dS = (l/T)flQ, and dG = cf>de + J2j Pjdnj + V dP — SdT. Now 
note that the charge carried by dri{ moles of particles of type, i, is 
de — ZiFdrii. Thus, the change in the Gibbs free energy can be written 

dG = <f>de + VdP - SdT + ^ fijdnj 

0 ) 

— + V dP — SdT + (ziFcf) + fii)drii + ^ iijdrij. 

i+i 

For fixed P, T, and nj(j 7 ^ i), the change in the Gibbs free energy is 

0 dG )p,T,n m = ki F( t> + t*i)drii. ( 2 ) 

From Exercise 2.6, we see that this is just the work needed to add dn t moles 
of charged particles of type, i, keeping all other quantities fixed. The quantity 

k = ZiFcf) + (it, (3) 

is called the electrochemical potential. 


2.F.5. Grand Potential 

A thermodynamic potential which is extremely useful for the study of quantum 
systems is the grand potential. It is a thermodynamic potential energy for 
processes carried out in open systems where particle number can vary but T, X, 
and {//'} are kept fixed. 

The grand potential, fi, can be obtained from the internal energy by adding 
terms due to thermal and chemical coupling of the system to the outside 
world: 


Q. = U - TS - Y, Pj N J = XY • (2.122) 

j 

The Legendre transformation in Eq. (2.122) changes the independent variables 
from (5, X, {#,}) to (T,X, {Hj})- If we add the differential of Eq. (2.122) to 
Eq. (2.67), we obtain 


dCl < -SdT + YdX- EVm', 
j 


(2.123) 
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and 




fdn\ 

n j = ~ 

\ d »j 


TM^j) 
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(2.124) 

(2.125) 


(2.126) 


The Maxwell relations obtained from the grand potential are 


dS 

dX 


dY\ 


dS 

'dY 

M. 


t.W,) v )7 V 

= ( 9 JL\ 


TMn L,} 




—(&) . 

9X Jt, W > 


and 


(2.127) 

(2.128) 

(2.129) 



TXMtj} 


'dNj\ 


(2.130) 


and are very useful in treating open systems. The grand potential is a 
thermodynamic potential energy for a reversible process carried out at constant 
T, X , and {//'•}. For a change in the thermodynamic state of the system, the 
change in the grand potential can be written 


Afl < - j SdT + j Y dX - Attlee - j Njdfij, 


(2.131) 


where the equality holds for reversible changes and the inequality holds for 
spontaneous changes (AWf ree is defined in Section 2.F.1). For a process at fixed 
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T, X , and {pj}, we obtain 

( Afi )r, *,(„'} < (-iffta). (2132) 

Thus, for a reversible process at constant T, X, and {//'.}, work can be stored as 
grand potential and can be recovered completely. 

For a process at fixed T, X , and {p'-} for which no work is done, we obtain 

< 0, (2.133) 

and we find that an equilibrium state at fixed T, X, and {p'-} is a state of 
minimum grand potential. 


2.G. RESPONSE FUNCTIONS 

The response functions are the thermodynamic quantities most accessible to 
experiment. They give us information about how a specific state variable 
changes as other independent state variables are changed under controlled 
conditions. As we shall see in later chapters, they also provide a measure of the 
size of fluctuations in a thermodynamic system. The response functions can be 
divided into (a) thermal response functions, such as heat capacities, (b) 
mechanical response functions, such as compressibility and susceptibility, and 
(c) chemical response functions. We shall introduce some thermal and 
mechanical response functions in this section. 


2.G.I. Thermal Response Functions (Heat Capacity) 


The heat capacity, C, is a measure of the amount of heat needed to raise the 
temperature of a system by a given amount. In general, it is defined as the 
derivative, C = ( jlQ/dT ). When we measure the heat capacity, we try to fix all 
independent variables except the temperature. Thus, there are as many different 
heat capacities as there are combinations of independent variables, and they 
each contain different information about the system. We shall derive the heat 
capacity at constant X and {A,}, Cx,{Nj}> and we shall derive the heat capacity at 
constant Y and {A,}, Cy ^}. We will derive these heat capacities in two 
different ways, first from the first law and then from the definition of the 
entropy. 

To obtain an expression of C x ,{Nj}, we shall assume that X, T, and {A,} are 
independent variables. Then the first law can be written 


4Q = dU - YdX - rfjdNj 
j 



dT + 

X,{Nj} 



1 


-Y 


dX 


T,m 



dNj. 


(2.134) 
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For constant X and {A^}, we have [4Q] X {#.> = C x ^dT and we find 


C UNj) 



(2.135) 


for the heat capacity at constant X and {Nj}. 

To obtain an expression for CV {#.}, we shall assume that Y, T, and {A^} are 
independent variables. Then we can write 



(2.136) 


If we substitute the expression for dX into Eq. (2.134), we obtain 



(2.137) 


For constant Y and {Nj} we have [4Q\y {Nj} ~ C Y,{Nj}dT and we find 


C Y,{Nj} ~ C X,{Nj} + 



(2.138) 


for the heat capacity at constant Y and {A^}. 

For n moles of a monatomic substance, these equations simplify. Let us write 
them in terms of molar quantities. We can write the heat capacity in the form 
C X , n = (dU/dT) X n = n(du/dT) x , where u — U/n is the molar internal energy 
and x — Xjn is a molar density of the mechanical extensive variable. The molar 
heat capacity is then c x = ( du/dT) x so that C Xn — nc x . Similarly, let us note 
that (dXjdT)y n =n(dx/dT)y and (dU/dX) Tn = (du/dx) T . Therefore, the 
molar heat capacity at constant Y is c Y = c x + \{du/dx) T - Y] (dx/dT) Y . 

It is useful to rederive expressions for Cy^.} and Cypv} from the entropy. 
Let us first assume that T, X, and {Nj} are independent. Then for a reversible 
process, we obtain 


ftQ = TdS = T (— ) 


dT + T 




(© 


dX + 


U Nj} 


Yt( — 

V V9A!/ 


T,X,{N ¥j } 


dNj. 


(2.139) 
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For a processes which occurs at constant X and {A^}, Eq. (2.139) becomes 


m 


X,{Nj } 


-(I) 


dT< 




and therefore 


c x,m 




(2.140) 


(2.141) 


The second term comes from Eq. (2.97). 

Let us now assume that T, Y and {TV,} are independent. For a reversible 
process, we obtain 


4Q = TdS = 



dT + T 




dNj. 


T,Y,{N ¥ j) 

(2.142) 


If we combine Eqs. (2.136) and (2.139), we can also write 


4Q =TdS=T 


dS\ ( dS\ 

.dr) xm + \dx) Tm 



+ T 



T,{Nj } 



dY 

r,m 



(2.143) 


If we now compare Eqs. (2.142) and (2.143), we find 
C 


_ jos 
■y,m - T [gf 


Y,m 


„ rr fdS\ (dX 

- c ^ + T {Wc) Tm (df, 


Y,m 


\9T 2 J Y , {Nj} 


(2.144) 


The last term in Eq. (2.144) comes from Eq. (2.109). 

We can obtain some additional useful identities from the above equations. If 
we compare Eqs. (2.100), (2.138), and (2.144), we obtain the identity 


YdS\ _ J_ 
\<9Xy t,{Nj} T 



UNj) 






(2.145) 
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Therefore, 


d^Y\ 1 ( 9Cx,m \ 

WJxm t\ ox J Tm} ’ 


(2.146) 


where we have used Eqs. (2.4) and (2.145). 

For a monatomic substance, it is fairly easy to show that the molar heat 
capacity at constant mechanical molar density, x , is c x = T(ds/dT) x — 
-T(d 2 a/dT 2 ) x , and the molar heat capacity at constant Y is 
c Y = T{ds/dT) y = —T(d 2 a/dT 2 ) Y . We also obtain the useful identities 
(ds/d x) T = {l/T)[(du/dx) T -Y\ = ~{dY/dT) x and (d 2 Y/dT 2 ^= 
-(1 /T)(dc x /dx) T . 


2.G.2. Mechanical Response Functions 

There are three mechanical response functions which are commonly used. They 
are the isothermal susceptibility , 






T,m 


the adiabatic susceptibility, 


XS,{Nj} 




and the thermal expansivity, 


a Y,m 



Y,{Nj} 


(2.147) 


(2.148) 


(2.149) 


Using the identities in Section 2.B, the thermal and mechanical response 
functions can be shown to satisfy the identities 

Xt,{Nj}(Cy,{Nj} ~ Cx,{Nj}) = T(a Y '{ Nj }) 2 , (2.150) 

C Y,{Nj}iXT,{Nj} - XS,{Nj}} = T(&Y,{Nj}) 2 , (2.151) 

and 


C r,{^} = Xt,{Nj} 
Cx,{Nj} XS,{Nj} 


(2.152) 


The derivation of these identities is left as a homework problem. 



54 


INTRODUCTION TO THERMODYNAMICS 


For PVT systems, the mechanical response functions have special names. 
Quantities closely related to the isothermal and adiabatic susceptibilities are the 
isothermal compressibility. 


1 f dV \ _ 1 ( dlG \ 

KT ' W ~ ~V \9p)t, { n j} ~ V V^Vr -,m 


and adiabatic compressibility, 

1 ( dV \ _ 1 ( dlH \ 

Km) ~ ~v{dp) m} - V \dP*) m} 


(2.153) 


(2.154) 


respectively. The thermal expansivity for a PVT is defined slightly differently 
from above. It is 


For a monatomic PVT system the mechanical response functions become 
even simpler if written in terms of densities. The isothermal and adiabatic 
compressibilities are kt = — (1 /v)(dv/dP) T and k s = — (1 /v)(dv/dP) s , 
respectively, where u = V/n is the molar volume. The thermal expansivity is 
a P = (1 /v)(dv/dT) p . 


■ EXERCISE 2.8. Compute the molar heat capacities, c v and c P , the 
compressibilities, k t and k s , and the thermal expansivity, a P , for a 
monatomic ideal gas. Start from the fact that the molar entropy of the gas is 
s = + R\n[(v/v 0 )(T/T 0 ) 3/2 ] (v = V/N is the molar volume), and the 

mechanical equation of state is Pv = RT. 

Answer: 

(a) The molar heat capacity, c v : The molar entropy is s = §/?+ 
/?ln[(v/v 0 )(r/r 0 ) 3/2 ]. Therefore (ds/dT) v = (3R/2T) and c v = 
T{ds/dT) v = 3R/2. 

(b) The molar heat capacity, c P \ The molar entropy can be written 
5 = f R + Rln[(Po/P)(T/Tof 2 }. Then (ds/dT) p = 5R/2T and = 
T{ds/dT) p = 5R/2. 

(c) The isothermal compressibility, k t : From the mechanical equation of 
state, we have v = (RT/P). Therefore, (dv/dP) T = ~{v/P) and 
K T = -(1 /v){dv/dP) T = (1 /P). 



STABILITY OF THE EQUILIBRIUM STATE 


55 


(d) The adiabatic compressibility, k s : We must first write the molar 
volume as a function of s and P. From the expressions for the molar 
entropy and mechanical equation of state given above we find 
v = vq(Po/P) 3 ^ 5 exp[(2$/5/?) — 1]. Then {dv/dP) s = — (3v/5 P) and 
«, = -(l/v)(av/dP), = (3/5P). 

(e) Thermal Expansivity, a P : Using the mechanical equation of state, we 
find a P = (1 /v)(dv/dT) p = (1 /T). 


2.H. STABILITY OF THE EQUILIBRIUM STATE 

The entropy of an isolated equilibrium system (cf. Section 2.D.3) must be a 
maximum. However, for a system with a finite number of particles in 
thermodynamic equilibrium, the thermodynamic quantities describe the average 
behaviour of the system. If there are a finite number of particles, then there can 
be spontaneous fluctuations away from this average behaviour. However, 
fluctuations must cause the entropy to decrease. If this were not so, the system 
could spontaneously move to new equilibrium state with a higher entropy 
because of spontaneous fluctuations. For a system in a stable equilibrium state, 
this, by definition, cannot happen. 

We can use the fact that the entropy must be maximum to obtain conditions 
for local equilibrium and for local stability of equilibrium systems. We will 
restrict ourselves to PVT systems. However, our arguments also apply to 
general YXT systems. 

2.H.I. Conditions for Local Equilibrium in a PVT System 

Let us consider a mixture of l types of particles in an isolated box of volume, 
V T , divided into two parts, A and B, by a conducting porous wall which is free 
to move and through which particles can pass (cf. Fig. 2.8). With this type of 
dividing wall there is a free exchange of heat, mechanical energy, and particles 
between A and B. One can think of A and B as two different parts of a fluid (gas 
or liquid), or perhaps as a solid (part A) in contact with its vapor (part B ). We 
shall assume that no chemical reactions occur. Since the box is closed and 
isolated, the total internal energy, Up, is 


U T = J2 U «’ ( 2 - 156 ) 

ot=A,B 

where U a is the internal energy of compartment a. The total volume, V T , is 

V T = v “> 

ot=A,B 


(2.157) 
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Fig. 2.8. An isolated, closed box containing 
fluid separated into two parts by a movable 
porous membrane. 


where V a is the volume of compartment a. The total number of particles, Njj, 
of type j is 

Nj,t = Y, N i.«’ (2-158) 

a=A,B 

where N Jia is the total number of particles of type j in compartment a. The total 
entropy, S T , is 


St = S «’ (2.159) 

a=A,B 

where S a is the total entropy of compartment a. 

Let us now assume that spontaneous fluctuations can occur in the energy, 
volume, and particle number of each cell subject to the constraints 

A U T = AV t = A Njj = 0 (2.160) 

(assume that no chemical reactions occur) so that A Ua = —A Ub, A Va = 
—A Vb, and A Nj : a = —A N^b- The entropy change due to these spontaneous 
fluctuations can be written 



From Eqs. (2.58)-(2.60) and (2.160), we can write Eq. (2.161) in the form 
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where T a and P a are the temperature and pressure, respectively, of the material 
in compartment a, and //'■ is the chemical potential of particles of type j in 
compartment a. 

For a system in equilibrium, the entropy is a maximum. Therefore, any 
spontaneous changes must cause the entropy to decrease. However, AU a ,AV a , 
and ANj t A can be positive or negative. Thus, in order to ensure that ASj- < 0, 
we must have 


and 


II 

(2.163) 

II 

(2.164) 

B 1 J 1 , 

(2.165) 


Equations (2.163)-(2.165) give the conditions for local equilibrium in a system 
in which no chemical reactions occur. Thus, if the interface between A and B 
can transmit heat, mechanical energy, and particles of all types, then the two 
systems must have the same temperature, pressure, and chemical potential for 
each type of particle in order to be in equilibrium. It is important to note that if 
the partition cannot pass particles of type, i, then A N i}A = A N ijB = 0 and we 
can have /i- A ^ n' i B . If the partition is nonporous and fixed in position so no 
particles can pass and no mechanical work can be transmitted, then we can have 
Pa Pb and fij A ^ \j!- B (j = 1 , . . . , /) and still have equilibrium. 

2.H.2. Conditions for Local Stability in a PVT System [12, 13] 

Stability of the equilibrium state places certain conditions on the sign of the 
response functions. To see this, let us consider a closed isolated box with 
volume Vt, total entropy St, total internal energy Uj, and a total number of 
particles Njj of type j, where j = 1 We shall assume that the box is 
divided into M cells which can exchange thermal energy, mechanical energy, 
and particles. We shall denote the equilibrium volume, entropy, internal energy, 
and number of particles of type j for the ath cell by V° a , 5^, U J, and Nf a , 
respectively. The equilibrium pressure, temperature, and chemical potentials of 
the various cells are P°, T°, and , respectively (they must be the same for all 
the cells). 

Because there are a finite number of particles in the box, there will be 
spontaneous fluctuations of the thermodynamic variables of each cell about 
their respective equilibrium values. These spontaneous fluctuations must be 
such that Vt, and Uj, and Njj remain fixed. However, since the equilibrium 
state is stable, fluctuations must cause S T to decrease. If it did not decrease, the 
equilibrium state would be unstable and spontaneous fluctuations would cause 
the system to move to a more stable equilibrium state of higher entropy. 
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We shall assume that fluctuations about the equilibrium state are small and 
expand the entropy of the ath cell in a Taylor expansion about its equilibrium 
value 


Sa(U a ,V a ,{N Jta }) 

/rrO T / 0 r\rO 


= Sl(Ul,Vl {*"„}) + A U a + ( H)° 

\ dU J V,{Nj} \ dV J U,{Nj} 

+ ^i\dNj) 

j= i x J/ 


AN la +UA /<dSa 


U,V,{N k #} 


dS a 


. ... 


dU a 
dS a 


A U n 




K dV aJ u a ,{ Nj, a } J=l \ dN j,aJ U a ,V a ,{N k #, a } 

In Eq. (2.166), we define 


A Nj, a } + 


(2.166) 


dS a 


d 2 S\° 


Af^-J =(^) AC/ a + 

V a ,{Nj, a } \ dU ^J V,{Y,} 


d_ fds\ 


( d / dS 

j=l \ J \ / V,{Nj}/ u,V,{ Nl¥ j} 


\° 

A Nj, a . 


AV„ 


( 2 . 167 ) 


A similar expression holds for A (dS a /dV a ) Ua { N . a y For A (dS a / 
9N m)uMN kM ,„)- we have 


dS a 


d 


m 


A V a 


dN iv) U a ,V a ,{N k ^ a } \ dU \ dN jJ U,V,{N k *j}/ K{Ar . } 

(d_(dS\ \° 

+ \dvKdN i ) u , VM J v{Nj} K 

+ y'fA(^ V 

^ ft M. I ftAJ, 


i^l \ dN J/ u,v,{N k #}, 


A N tfil . (2.168) 


U,V,{N k #} 


In Eqs. (2.166)-(2.168), the superscripts 0 on partial derivatives indicate that 
they are evaluated at equilibrium. The fluctuations AU a , AV a , and A Nj ta are 
defined as AU a = U a — U%, AV a = V a — Vj, and A Nj t0 = N j>a - N® a and 
denote the deviation of the quantities U a ,V a , and Nj >a from their absolute 
equilibrium values. 
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We can obtain the total entropy by adding the contributions to the entropy 
from each of the M cells. Because the entropy can only decrease due to 
fluctuations, the terms which are first order in the fluctuations, AU a , AV a , and 
ANj, a must add to zero (this is just the condition for local equilibrium obtained 
in Section 2.H.1). Therefore, the change in the entropy due to fluctuations must 
have the form 



AU a + A 

V a ,{N ha } 



AV a 

tUtya} 



(2.169) 


Equation (2.169) can be written in simpler form if we make use of Eqs. (2.58)- 
(2.60). We than find 




A N la 




(2.170) 


or 


1 


m r 

a s T = ~^y 

a=l 


AT a AS a - AP a AV a + Afi! ja AN ha 
j= 1 


+ 


(2.171) 


In Eq. (2.171), we have used the relation TAS = A U + PAV— Ylj=i 
Equation (2.171) gives the entropy change, due to spontaneous fluctuations, in a 
completely general form. We can now expand ASj in terms of any set of 
independent variables we choose. 

Let us choose T, P, and N ; as the independent variables. Then we can 
write 




Av a = 



o 

A T a + 

Wl 



o 

A T a + 

P,{Nj} 




T,P,{N k 


A N Ua , 


(2.172) 




o 

A N j>a , 

T,P,{Nk#} 


(2.173) 
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and 


/X 0 


# /dd\ 


r,W 


M)° A Pa + y( 9 A)° 

dP /T,{Nj} f=l \ dN J T,P,{N k ^} 


AT « + Up . ... AP « + £ 


m,cr 

(2.174) 


If we now substitute Eqs. (2.172)-(2.174) into Eq. (2.171) and use the Maxwell 
relations (2. 1 12)— (2. 1 15), the entropy change becomes 

^ = [© W) (Ar “ )2 ~ 2 (^)p, m ATaAPa 


-(Z) 0 .S* P J + ±±(S) . ^AArJn- 


\ dp y T,{Nj) ;= l pi W'/ !,/>,{%} 

If we make use of Eqs. (2.6), (2.8) and (2.100), we can write 

dS_\ _/ap\ (??\ 2 

P,{Nj} v,{Nj} \dv) T,{Nj}\^ T ) P,{Nj} 


(2.175) 


(2.176) 


If we plug this into Eq. (2.175), we obtain 


1 M 

AST= -jfit 


a=l 
l l 


dS 

dT 


+EE(1) 


0 

(dP 

I (Ar 0 ) 2 

V,{Nj) 

~ (dV 

0 


m, c 

A^j,a 

T,r,{N k *} 



[AVV 2 


T,{Nj} 
+ * * * > 




(2.177) 


where 


|4VJ «> + (l ™ 

Because the fluctuations A T a , A P a , and A Nj >a are independent, the require- 
ment that A S T < 0 for a stable equilibrium state leads to the requirement that 


c v,m - T [gf 


V,{Nj) 
l l 


„ 1 fdV\ 

> 0 , K TANj} ) > 0 , 


T,{Nj} 


and EE 




'kl j^\ dN iJ T,P,{"Vi} 


ANiANj > 0. 


(2.179) 
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Conditions (2.179) are a realization of Le Chateliers’s famous principle: If a 
system is in stable equilibrium, then any spontaneous change in its parameters 
must bring about processes which tend to restore the system to equilibrium. 

The first condition in Eq. (2.179), Cy,{fy} > 0, is a condition for thermal 
stability. If a small excess of heat is added to a volume element of fluid, then the 
temperature of the volume element must increase relative to its surroundings so 
that some of the heat will flow out again. This requires that the heat capacity be 
positive. If the heat capacity were negative, the temperature would decrease and 
even more heat would flow in, thus leading to an instability. From Eqs. (2.150) 
and (2.179) we can also show that 

Cp,{Nj} > Cv,{Nj} > 0. (2.180) 


The second condition in Eq. (2.179), Kt,{Nj} > 0, is a condition for 
mechanical stability. If a small volume element of fluid spontaneously 
increases, the pressure of the fluid inside the fluid element must decrease 
relative to its surroundings so that the larger pressure of the surroundings will 
stop the growth of the volume element. This requires that the compressibility be 
positive. If the compressibility were negative, the pressure would increase and 
the volume element would continue to grow, thus leading to an instability. From 
Eqs. (2.151) and (2.179) we can show that 

k t,{Nj} > k s,{Nj} >0. (2.181) 

The third condition, J^| =1 IZ/=i(^/^f)r^>,{^} AW, AN, > 0, where AN { 
and ANj are arbitrary variations, is the condition for chemical stability. We can 
write the condition for chemical stability in matrix form: 


{AN u AN 2 ,...,m) 


( A 4 1,1 Mi ,2 ■" A*i,/ \ ( AN i \ 

an 2 


M2,l M2, 2 

V M/,i M/,2 


M2 ,/ 
M/,/ ) 


>0, (2.182) 


\mj 


where /xj f = ( dp'j/dNi) T p < Nk 1 . Because of the Maxwell relation /ij - = p\ j [cf. 
Eq. (2.115)], the matrix 


III 

la. 

($> 

M24 

;• 

Mi ,2 ' 

M2, 2 ' 

’ Mi,/ N 
■ M2 ,/ 


\ M/,i 

M/,2 ‘ 

' M/,/ / 


(2.183) 


is symmetric. In addition, in order to satisfy the condition for chemical stability, 
the matrix p' must be a positive definite matrix. A symmetric matrix is 
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positive definite if > 0 (i = and if every principal minor is positive 


■ EXERCISE 2.9. A mixture of particles, A and B, has a Gibbs free 
energy of the form 

G = n A /x%(P, T) + n B ^° B (P, T) + RTn A Info) + RTn B Info) + A , 

where n = n A + n B ,x A = n A /n, and x B = n B /n (n indicates mole number), 
fi Q A and ji° B are functions only of P and T. Plot the region of thermodynamic 
instability in the x A — T plane. 

Answer: For chemical stability, the matrix 

( /M,a ^a,b\ 

P B,A P*B,bJ 

must be symmetric positive definite. This requires that (dfi A /dn A ) p Tng > 0, 
{dfi B /dn B ) P TriA > 0, and (dfi A /dn B ) PTriA = {d^ B /dn A ) p Tng < 0. The 
chemical potential of the A-type particle is 


~ (dn~) - fi A (P,T) +RT\n(x A ) + X-~- 

\O n Aj P) T,n B n n 


A condition for thermodynamic stability is 

(P i) = — ?§- 2^ + 2^>0, (3) 

\on A J PTflB x A n 2 n 2 n 3 

or x% — x A + RT/2X > 0. For T > A/2 R, this is always satisfied. A plot of 
T = (2X/R)(x a — xfy is given below. 


critical point 



0 0.5 1-0 X A 

The shaded region corresponds to x\ — x A + RT/2X < 0 and is thermo- 
dynamically unstable. The unshaded region is thermodynamically stable. For 
T < A/2 R, two values of x A satisfy the condition x% - x A + RT/2X > 0 for 




STABILITY OF THE EQUILIBRIUM STATE 


63 


each value of T. These two values of x A lie outside and on either side of the 
shaded region and are the mole fractions of two coexisting phases of the 
binary mixture, one rich in A and the other rich in B. For T > A/2 R, only 
one value of x A satisfies the condition x% — x A + RT /2A > 0, so only one 
phase of the substance exists. (As we shall see in Chapter 3, a 
thermodynamically stable state may not be a state of thermodynamic 
equilibrium. For thermodynamic equilibrium we have the additional 
condition that the free energy be minimum or the entropy be maximum. A 
thermodynamically stable state which is not an equilibrium state is 
sometimes called a metastable state. It can exist in nature but eventually 
will decay to an absolute equilibrium state.) 


2.H.3. Implications of the Stability Requirements for 
the Free Energies 

The stability conditions place restrictions on the derivatives of the thermo- 
dynamic potentials. Before we show this, it is useful to introduce the concept of 
concave and convex functions [14]: 

(a) A function /(x) is a convex function if d 2 f(x)/dx 2 > 0 for all x (cf. 
Fig. 2.9). For any x\ and x 2 the chord joining the points f(x\) and f(x 2 ) 
lies above or on the curve /(x) for all x in the interval xi < x < x 2 . If 
df(x)/dx exists at a given point, the tangent at that point always lies 
below the function except at the point of tangency. 

(b) A function f(x) is a concave function of x if the function —f(x) is 
convex. 

We can now consider the effect of the stability requirements on the Helmholtz 
and Gibbs free energies. 



Fig. 2.9. The function f(x ) is a convex function of x. 






Fig. 2.10. (a) A plot of the Gibbs free energy and its slope as a function of pressure, (b) 
A plot of the Gibbs free energy and its slope as a function of temperature. Both plots are 
done in a region which does not include a phase transition. 
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and from Eq. (2.110) and the stability condition, Eq. (2.181), we can write 




— — Vk t ^ n .} 

T,{Nj} 


< 0 , 


(2.187) 


Thus, the Gibbs free energy is a concave function of temperature and a concave 
function of pressure. 

It is interesting to sketch the free energy and its slope as function of pressure 
and temperature. A sketch of the Gibbs free energy, for a range of pressure and 
temperature for which no phase transition occurs, is given in Fig. 2.10. The case 
of phase transitions is given in Chapter 3. 

The form of the Gibbs and Helmholtz free energies for a magnetic system is 
not so easy to obtain. However, Griffiths [15] has shown that for system of 
uncharged particles with spin, G(T, H) is a concave function of T and H and 
A(T, M ) is a concave function of T and convex function of M. In Fig. 2.1 1, we 
sketch the Gibbs free energy and its slope as a function of T and H for a 
paramagnetic system. 



Fig. 2.11. (a) A plot of the Gibbs free energy and its slope as a function of applied field, 
(b) A plot of the Gibbs free energy and its slope as a function of temperature. Both plots 
are done in a region which does not include a phase transition. 
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SPECIAL TOPICS 

► S2.A. Cooling and Liquefaction of Gases [6] 

All neutral gases (if we exclude gravitational effects) interact via a potential 
which has a hard core and outside the core a short-ranged attractive region. If 
such a gas is allowed to expand, it must do work against the attractive forces 
and its temperature will decrease. This effect can be used to cool a gas, although 
the amount of cooling that occurs via this mechanism alone is very small. We 
shall study two different methods for cooling: one based solely on free 
expansion and one which involves throttling of the gas through a porous plug or 
constriction. The second method is the basis for gas liquefiers commonly used 
in the laboratory. 


► S2.A.1. The joule Effect: Free Expansion 

Experiments which attempted to measure cooling due to free expansion were 
first performed by Gay-Lussac in 1807 and were improved upon by Joule in 
1843. The free expansion process is shown schematically in Fig. 2.12. The gas 
is initially confined to an insulated chamber with volume V t at pressure P, and 
temperature 7). It is then allowed to expand suddenly into an insulated 
evacuated chamber with volume Vf. Since the gas expands freely, no work will 
be done and AW = 0. Furthermore, since both chambers are insulated, no heat 
will be added and A Q = 0. Thus, from the first law the internal energy must 
remain constant, AI/ = 0, during the process. The only effect of free expansion 
is a transfer of energy between the potential energy and kinetic energy of the 
particles. 

Because free expansion takes place spontaneously, the entropy of the gas will 
increase even though no heat is added. During the expansion we cannot use 
thermodynamics to describe the state of the system because it will not be in 
equilibrium, even locally. However, after the system has settled down and 



Fig. 2.12. The Joule effect. Free expansion of 
a gas from an insulated chamber of volume, 
V\, into an insulated evacuated chamber of 
volume, V2, causes cooling. 



SPECIAL TOPICS: COOLING AND LIQUEFACTION OF GASES 


67 


reached a final equilibrium state, we can use the thermodynamics to relate the 
initial and final states by finding an imaginary reversible path between them. 

During the expansion the internal energy does not change and the particle 
number does not change. Thus, the internal energies and particle numbers of the 
initial and final states must be the same, and for our imaginary reversible path 
we can write 

%),*• i2 '“ i 


where n is the number of moles of gas. From Eq. (2.188), we can write 


dT = - 


\dV/T,n 

(M) 

\dT ) V,n 


dV = 



dV, 

U,n 


(2.189) 


where we have made use of the chain rule, Eq. (2.6). The quantity ( dT/dV) U n 
is called the Joule coefficient. 

Let us compute the Joule coefficient for various gases. From Eq. (2.145), we 
know that 



(2.190) 


For an ideal gas we have the equation of state, PV — nRT, and we find that 
(dU/dV) Tn = 0. Therefore, for an ideal gas the Joule coefficient, 
(dT/d V) Un = 0, and the temperature of an ideal gas cannot change during 
free expansion. 

For a van der Waals gas [cf. Eq. (2.12)], we have 


/ dU\ _ an 2 


(2.191) 


For this case, there will be a change in internal energy due to interactions as 
volume changes if temperature is held fixed. To obtain the Joule coefficient, we 
must also find the heat capacity of the van der Waals gas. From Eq. (2.146), we 
obtain 




(2.192) 


Therefore, for a van der Waals gas the heat capacity, C v , n , is independent of 
volume and can depend only on mole number and temperature: 


Cy,n = Cv,n{T,n). 


(2.193) 
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Since the heat capacity, Cy ; „, contains no volume corrections due to 
interactions, it is thermodynamically consistent to choose its value to be equal 
to that of an ideal gas (at least in the regime where the van der Waals equation 
describes a gas). If we neglect internal degrees of freedom, we can write 
Cy,n = \nR. The Joule coefficient for a van der Waals gas then becomes 


fdT\ 2 an 

\dv)u~ 3RV 2 


(2.194) 


Using Eq. (2.194), we can integrate Eq. (2.189) between initial and final states. 
We find 




(2.195) 


The fractional change in temperature is therefore 



2 an /I 1 \ 
3 * 7 ) Vi)' 


(2.196) 


If Vf > V„ the temperature will always decrease. We can use values of the van 
der Waals constant, a, given in Table 2.1 to estimate the change in temperature 
for some simple cases. Let us assume that V, = 10“ 3 m 3 , 7) = 300K, 
n = 1 mol, and Vf = oo (this will give maximum cooling). For oxygen we 
have AT & —0.037, and for carbon dioxide we obtain, AT « -0.097. We must 
conclude that free expansion alone is not a very effective way to cool a gas. 


► S2.A.2. The Joule-Kelvin Effect: Throttling 

Throttling of a gas through a porous plug or a small constriction provides a 
much more efficient means of cooling than free expansion and is the basis of 
most liquification machines. The throttling process in its simplest form is 
depicted in Fig. 2,13. A gas initially at a pressure, P„ temperature, 7), and 
volume, V,-, is forced through a porous plug into another chamber, maintained at 
pressure, Pf < P h All chambers and the plug are insulated so AQ = 0 for the 
process. The gas inside the plug is forced through narrow twisting chambers 
irreversibly. Work must be done to force the gas through the plug. Even though 
the entire process is irreversible, we can use thermodynamics to relate the initial 
and final states. 

The net work done by the gas is 

v f o 

AW = | PfdV + | PidV = PfVf - PiVt. 

0 Vi 


(2.197) 
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Fig. 2.13. The Joule-Kelvin effect. Throttling of a gas through a porous plug can cause 
cooling or heating. 


From the first law, A U = — AW since AQ = 0. Thus, 


Uf + PfV f = U i + P i V i 


(2.198) 


or, in terms of enthalpy, 


H f = Hi. (2.199) 

Thus, the throttling process is one which takes place at constant enthalpy. 

Let us now construct a hypothetical reversible path to describe the constant 
enthalpy process. For each differential change along the reversible path, we 
have (assuming the total particle number remains constant) 

[dH) n = 0 = TdS+ VdP. (2.200) 

We see that the increase in entropy due to the throttling process is accompanied 
by a decrease in pressure. It is convenient to use temperature and pressure as 
independent variables rather than entropy and pressure. We therefore expand 
the entropy 



(2.201) 


and obtain 


[dH} n = 0 = C Pin dT + 




(2.202) 


In Eq. (2.202) we have used Eqs. (2.144) and (2.112). Equation (2.202) can be 
rewritten in the form 


dP, 


H,n 


dT = 


er 

dP 


(2.203) 
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where (dT fdP) Hn is the Joule-Kelvin coefficient and is defined 


Vjk 


( dT\ = _ (f) r , w = J_ L (dV\ 

\d P ) H,n (| f)p,« Cp ' n . \® T /P,n 


(2.204) 


Let us now compute the Joule-Kelvin coefficient for various gases. For an ideal 
gas, ( dV/dT) Pn = ( V /T ) and therefore the Joule-Kelvin coefficient, 
(dT/dP) Hn , equals 0. There will be no temperature change during the 
throttling process for an ideal gas. Furthermore, since 7) = 7} for ideal gases, 
PfVf = PiVi and no net work will be done (AW = 0). If work had been done on 
or by the gas, we would expect a temperature change since the process is 
adiabatic. 

For a van der Waals gas, assuming that Cv, n = \ nR, we find 



2a 

RT 



5 3 a fv — ZA 2 | 

2 “ RTv V v ) j ’ 


(2.205) 


where v = V/n is the molar volume. Equation (2.205) is straightforward to 
obtain from the righthand term in Eg. (2.204) and from Egs. (2.12), (2.144), and 
(2.100). For an interacting gas, such as the van der Waals gas, the Joule-Kelvin 
coefficient can change sign. This is easiest to see if we consider low densities so 
that RTv » a and v > b. Then 


(dT\ ~±\2a_.} 
\dPj„~5R[RT j' 


(2.206) 


For low temperatures ( dT/dP) H n > 0, and gases cool in the throttling process, 
but at high temperatures, we have (dT / dP) H n < 0, and they heat up. Two 
effect determine the behaviour of the Joule-Kelvin coefficient. On the one hand, 
the gas expand, which gives rise to cooling. On the other hand, work can be 
done on or by the gas. If > P/Vf, then net work is done on the gas, which 
causes heating. If < P/Vf, then net work is done by the gas, which causes 
cooling. 

The inversion temperature (the temperature at which the sign of hjk 
changes) for the Joule-Kelvin coefficient will be a function of pressure. Since 
Cp n > 0, the condition for inversion [from Eq. (2.204)] is 



or, for a van der Waals gas [cf. Eq. (2.205)], 


v — b 


= b. 


(2.207) 


2 a 
RT 


v 


2 


(2.208) 
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We can use the van der Waals equation, (2.12), to write Eq. (2.208) in terms of 
pressure and temperature. First solve Eq. (2.208) for v as a function R, T, a, and 
b, and substitute into the van der Waals equation. This gives 


P 


2 

b 


flaRT 

3 RT 

a 

/ b 

2b 

b 2 


(2.209) 


The inversion curve predicted by the van der Waals equation has the shape 
of a parabola with a maximum at 7^ = %a/9bR. For C0 2 , 7^ = 911 K 
while the experimental value [15] is 7^ = 1500 K. For H 2 , 7^ = 99 K while 
the experimental value [15] is 7^ = 202 K. In Fig. 2.14, we plot the van der 
Waals and the experimental inversion curves for N 2 . The van der Waals 
equation predicts an inversion curve which lies below the experimental curve 
but qualitatively has the correct shape. For nitrogen at P = 10 5 Pa, hjk = 
1.37 x 10~ 7 K/Pa at 7 = 573K, \i JK = 1.27 x 10" 6 K/Pa at T = 373K, 
/ijk = 6.40 x 10 -6 K/Pa at 7=173K, and \l jk = 2.36 x 10~ 5 K/Pa at 
T = 93 K. (For experimental value of fijK for other substances, see the 
International Critical Tables [5].) We see that the cooling effect can be quite 
large for throttling. 

A schematic drawing of a liquefaction machine which utilizes the Joule- 
Kelvin effect is shown in Fig. 2.15. Gas is precooled in a vessel, A, below its 
inversion temperature and expanded through a small orifice into vessel, B, thus 
causing it to cool due to the Joule-Kelvin effect. The cooled gas is allowed to 
circulate about the tube in B so that the gas in the tube becomes progressively 


PfPaxlO 7 ) 



Pig. 2.14. A plot of the inversion temperature versus pressure for the Joule-Kelvin 
coefficient of N 2 . The solid line is the experimental curve [6]. The dashed line is the 
curve predicted by the van der Waals equation for a = 0.1408 Pa. M 6 /mol 2 and 
b = 0.03913m 3 /mol. 
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A B 



Fig. 2.15. Schematic drawing of a gas liquefier using the Joule-Kelvin effect. 


cooler before expanding through the orifice. The process is run continuously 
and eventually the gas liquefies and collects below. 

At times the Joule-Kelvin effect can lead to serious difficulties. For example, 
highly compressed H2, which has a low inversion temperature, can ignite 
spontaneously when leaking from a damaged container, because of Joule- 
Kelvin heating. 


► S2.B. Entropy of Mixing and the Gibbs Paradox [6] 

If entropy is a measure of disorder in a system, then we expect that an entropy 
increase will be associated with the mixing of distinguishable substances, since 
this causes an increase in disorder. As we shall now show, this does indeed 
occur even for ideal gases. 

Let us consider an ideal gas containing n\ moles of atoms of type A 1 , «2 
moles of atoms of type A2, . . . , and n m moles of atoms of type A m in a box of 
volume V and held at temperature T. The equation of state for the ideal gas 
mixture can be written 


p=J2 p ” ( 2 - 210) 

7=1 

where Pj is the pressure of the j th component (partial pressure) and is defined as 
follows: 


njRT 

P) = J — 


V 


( 2 . 211 ) 



SPECIAL TOPICS: ENTROPY OF MIXING AND THE GIBBS PARADOX 


73 


The mole fraction of atoms of type Aj is defined as follows: 


E m 

i=l n i 


p ' 


( 2 . 212 ) 


The change in the Gibbs free energy for an arbitrary change in the variables, 
P iT , n \, ... , n m , is 


dG=-SdT+VdP + Yl^ dn P {2.213) 

j= i 

where \lj is the chemical potential and S, V, and fij are given by Eqs. (2.109)- 
(2.111), but with mole number replacing particle number. 

Let us now find the entropy of mixing for a mixture of distinguishable 
monatomic ideal gases. The Gibbs free energy of n moles of a monatomic ideal 
gas is 


Ft 5 / 2 ] 

G{T,P,n) = -nRT lnl — + G (0) , (2.214) 

where G^ is a constant [cf. Eq. (2.107) and Exercise 2.3]. We will first consider 
a box held at temperature T and pressure P partitioned into m compartments, 
and let us assume the following: compartment 1 contains n\ moles of atoms of 
type A\ at pressure P and temperature T ; compartment 2 contains «2 moles of 
atoms of type A 2 at pressure P and temperature T ; and so on. The compartments 
are separated by walls that can transmit heat and mechanical energy, so the 
pressure and temperature is uniform throughout the system. The Gibbs free 
energy of the system is the sum of the free energies of each compartment and 
can be written 


G/(P, r, n m ) = - ^2 n J RT ln 

j= 1 


7 ^ 5/2 


+ g; 


( 0 ) 


(2.215) 


where G^ is a constant. If we now remove the partitions and let the gases mix 
so that the final temperature and pressure are T and P, the Gibbs free energy of 
the mixture will be 


G F (P,T,n u . .. ,n m ) = - ^njRT ln 


;= 1 


j5/2 


Pj j 


+ Gf 


= G,(P,T,m,...,n m ) + J2 nj RT ln (xj) + Gf } - Gf } , 

j= 1 


(2.216) 



74 


INTRODUCTION TO THERMODYNAMICS 


where Gp is a constant and we have used the relation Pj = Pxj. The change in 
the Gibbs free energy during the mixing process is therefore 

m 

A G = G f -Gi = YP njRT In (*/) + G^ 0) - g{ 0) . (2.217) 

j= i 

From Eqs. (2.109) and (2.217), the increase in entropy due to mixing is 

m 

ASmix = ~J2 n J R I" (*/)• (2.218) 

If xj = 1 (one compartment and one substance), then ASmix = 0, as it should be. 
If there are two compartments, each containing one mole, then x\ = ( 1 /2) and 
X 2 = (1/2) and AS m i X = 2R ln(2) and the entropy increases during mixing. 

It is important to note that Eq. (2.218) contains no explicit reference to the 
type of particles in the various compartments. As long as they are different, 
mixing increases the entropy. However, if they are identical, Eq. (2.218) tells us 
that there will still be an increase in entropy when the partitions are removed, 
even though the concept of mixing loses its meaning. Clearly, Eq. (2.218) does 
not work for identical particles. This was first noticed by Gibbs and is called the 
Gibbs paradox. The resolution of the Gibbs paradox lies in quantum mechanics, 
as we shall see when we come to statistical mechanics, (cf. Chapter 7). Identical 
particles must be counted in a different way from distinguishable particles (they 
have different “statistics”). This difference between identical and distinguish- 
able particles persists even in the classical limit and leads to a resolution of the 
Gibbs paradox. 


► S2.C. Osmotic Pressure in Dilute Solutions 

Each spring, when the weather begins to warm up, sap rises in the trees, and the 
yearly cycle of life starts again. The rising of sap is one of many examples in 
biological systems of the phenomenon called osmosis. We can easily 
demonstrate the effect in the laboratory. Let us take a beaker of water and 
immerse a long tube (open at both ends) in it. The water levels of the tube and 
of the beaker will be the same. Next, close off the bottom end of the tube with a 
membrane which is permeable to water but not sugar. The water levels will still 
be the same in the tube and the beaker. Now add a bit of sugar to the water in 
the tube. Water will begin to enter the tube through the membrane, and the level 
of the sugar solution will rise a distance h above the level of the water in the 
beaker (cf. Fig. 2.16). The excess pressure created in the tube, n = p s hg, is 
called the osmotic pressure ( p s is the density of the sugar solution and g is the 
acceleration of gravity). After equilibrium is reached, the pressure on the side of 
the membrane with sugar solution will be greater than that on the water side, by 
a factor 7r. The membrane must sustain the unbalanced force. 
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sugar solution 


semipermeable membrane 


Fig. 2.16. The osmotic pressure of the sugar solution is 7r = p s hg, where p s is the 
density of the sugar solution and g is the acceleration of gravity. 



It is instructive to show the same phenomenon in another way (cf. Fig. 2.17). 
We will consider a system consisting of pure water, separated by a permeable 
(to water) membrane from a solution of sugar and water. The entire system is 
kept at a fixed temperature T, and the membrane is rigid. At equilibrium, there 
will be an imbalance in the pressures of the two sides. If Pq is the pressure of 
the pure water, than the sugar solution will have a pressure P = Pq + 7r, where 
7r is the osmotic pressure. This imbalance of pressures is possible because the 
membrane is rigid and cannot transmit mechanical energy. Since the water is 
free to move through the membrane, the chemical potential of the pure water 
must be equal to the chemical potential of the water in the sugar solution. 

Let us first write the thermodynamic relations for this system. First consider 
the sugar solution. A differential change in the Gibbs free energy, G — 
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G(P , T,n w ,n s ), of the sugar solution (with n w moles of water and n s moles of 
sugar solution) can be written 

dG = —S dT + V dP + fi w dn w -j- fi s dn s , (2.219) 

where S = -(dG/dT) Pnw , ns is the entropy of the solution, V = (dG/dP) T nw fls 
is the volume of the solution, and fi w = {dG/dn w ) P T ris and fi s = {dG/dn s ) P j n [ 
are the chemical potentials of the water and sugar, respectively, in the solution. 
The chemical potentials are intensive and depend only on ratios n s /n w . It is 
convenient to introduce mole fractions 


Xw — 


n w + n s 1 + n s /n v 


and jtc = 


n s /n Y 


n w + n s 1 + n s /n y 


( 2 . 220 ) 


Since x w + x s = 1 , the chemical potentials can be written as a function of mole 
fraction, x s . Thus, fi w = n w (P, T,x s ) and fi s = fi s (P, T,x s ). 

At equilibrium the chemical potentials of the pure water and the water in the 
sugar solution will be equal. If we let fiffl(Po, T) denote the chemical potential 
of the pure water, we can write 


^(P 0 ,T) = ^(P,T,x s ) (2.221) 


as the condition for thermodynamic equilibrium. 

We want to obtain an expression for the osmotic pressure in terms of 
measurable quantities. We will simplify properties of the solution as much as 
possible. We will assume that the solution is dilute so that n s /n w < 1 and 
x s ~ n s /n w >C 1. We can construct a fairly simple model to describe the 
solution. We write the Gibbs free energy of the solution in the form 


G(P, T, n„ n w ) = n w ^ 0> (P, T) + n.pf 1 (P, T) - \ ^ 

n (2.222) 

+ n w RT In (x w ) + n 5 /?rin (x 5 ). 

The chemical potential contains contributions to the Gibbs free 

energy due to the presence of water (sugar) molecules and due to self- 
interactions. The term —A (n s n w /n) gives the contribution to the free energy due 
to interactions between sugar and water molecules. The last two terms on the 
right give contributions to the free energy due to mixing [cf. Eq. 3.57], The 
chemical potential of the water in the solution can now be written 


MP, T,x.) = (~) = /4 0) ( P , T) - Xx 1 , + RT In (1 - *,), (2.223) 

\on w j P T ris 


where fiffl(P,T) is the chemical potential of pure water at pressure P and 
temperature T. For a dilute solution, x s — n s /n <§: 1 and ln(l — x s ) = 
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— x s - (l/2)x 2 — • • • . Thus, to lowest order in x s = n s /n, we find 

Hw(P, T,X.) « vg>(P, T) - x s RT (2.224) 

for the chemical potential of water in a dilute sugar solution. 

We now can find an expression for the osmotic pressure, n = P — Pq. Let us 
note that water, as well as most liquids, is very incompressible. The 
compressibility, kt, of water at 0°C is kj — 4.58 x 10 -11 cm 2 /dyne. Therefore 
the quantity {d^/dP) Triw = (dV°/dn w ) TP = v® (v° is the partial molar 
volume of water in the absence of solute and V° is the volume of water in the 
absence of solute) remains approximately constant for small changes in 
pressure. With this observation we can integrate (d^ v /dP) T to find 

A(P, T) - /£(/>„, T) » v° (P - P 0 ) = v° w n. (2.225) 

Let us now assume that the change in the volume of water as we increase the 
number of moles is proportional to the number of moles so that U° = n w v®. 
Also, for very small concentrations of solute, we can assume that the change in 
the volume of water due to the presence of the solute is negligible so that 
V°«V, where V is the volume of the mixture. Then we can combine Eqs. 
(2.221), (2.224), and (2.225) to obtain 


7 r « 


n s RT 

V 


(2.226) 


Equation (2.226) is called van ’t Hoff’s law and, surprisingly, looks very much 
like the ideal gas law, although we are by no means dealing with a mixture of 
ideal gases. Equation (2.226) is well verified for all dilute neutral solvent-solute 
systems. 


■ EXERCISE 2.10. An experiment is performed in which the osmotic 
[ pressure of a solution, containing n suc moles of sucrose (C 12 H 22 O 11 ) and 1 kg 
! of water (H 2 0), is found to have the following values [2]: (a) for 
n S uc = 0.1 , 7T = 2.53 x 10 5 Pa, (b) for n suc = 0.2, tt = 5. 17 x 10 5 Pa, and (c) 

! f°r n suc = 0.3, 7r = 7.81 x 10 5 Pa. Compute the osmotic pressure of this 
system using van’t Hoff’s law. How do the computed values compare with 
the measured values? 

Answer: The molecular weight of water (H 2 0) is M H2 o = 18 gr/mol. 
Therefore, 1 kg of water contains 55.56 mol of water. The molar volume of 
water is vh 2 o = 18 x 10 _6 m 3 /mol. The osmotic pressure of the solution, 
according to van’t Hoff’s law, is 

n suc (8.317 J/K)(303 K) 

I ^ ~ 55.56 18 x 10 _6 m 3 /mol 
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The computed values are as follows: (a) For n suc = 0.1, n = 2.52 x 10 5 Pa, 
(b) for n suc — 0.2, n = 5.04 x 10 5 Pa, and (c) for n suc = 0.3, 7r = 7.56 x 
10 5 Pa. The predictions of van’t Hoff’s law are good for a dilute solution of 
sucrose in water, but begin to deviate as the mole fraction of sucrose 
increases. 


► S2.D. The Thermodynamics of Chemical Reactions 
[16-18] 

Chemical reactions occur in systems containing several species of molecules 
(which we will call A, B, C, and D), which can transform into one another 
through inelastic collisions. A typical case might be one in which molecules A 
and B can collide inelastically to form molecules C and D. Conversely, 
molecules C and D can collide inelastically to form molecules A and B. The 
collisions occur at random and can be either elastic or inelastic. To be inelastic 
and result in a reaction, the two molecules must have sufficient energy to 
overcome any potential barriers to the reaction which might exist. Chemical 
equilibrium is a dynamical state of the system. It occurs when the rate of 
production of each chemical species is equal to its rate of depletion through 
chemical reactions. The chemical reactions themselves never stop, even at 
equilibrium. 

In the early part of this century a Belgium scientist, de Donder, found that it 
was possible to characterize each chemical reaction by a single variable £, 
called the degree of reaction. In terms of £, it is then possible to determine when 
the Gibbs free energy has reached its minimum value (chemical reactions 
usually take place in systems with fixed temperature and pressure) and therefore 
when the chemical system reaches chemical equilibrium. It is important to 
notice that the concept of degree of reaction assumes that we can generalize the 
concept of Gibbs free energy to systems out of equilibrium. 


► S2.D.1. The Affinity 

Let us consider a chemical reaction of the form 


k\ 

—u a A — vbB ^ ucC + vdD. (2.227) 

k 2 

The quantities v a ,vb, v c, and v D are called stoichiometric coefficients', vj is the 
number of molecules of type j needed for the reaction to take place. By 
convention, v A and vb are negative. The constant k\ is the rate constant for the 
forward reaction, and ki is the rate constant for the backward reaction. The rate 
constants give the probability per unit time that a chemical reaction takes place. 
If the rate constants are known, then one can find the rate of change in the 
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number of each type of molecule involved in the chemical reaction. For 
example, the rate of change in the number, N A , of molecules of type A is given 
by 


dNA 

dt 


= -kiN^ Al N l B al + k 2 N l c cl N l " Dl , 


(2.228) 


where \v A \ denotes the absolute value of v A . Equation (2.228) reflects the fact 
that \v A \ molecules of A and \v B \ molecules of B must collide to destroy A 
molecules while \v c \ molecules of C and \v D \ molecules of D must collide to 
create A molecules. 

Let us now assume that initially there are n A = —v A nQ moles of A, 
n B = — z'flWo + Nb moles of B, nc = ocn' Q m °l es of C, and rip = v D n' Q + No 
moles of D. The reaction to the right will be complete when 

n A =0, n B = N b , n c = o c (n 0 + n 0 ), n D = o D (n 0 + n' 0 ) + N D . 


We next define the degree of reaction by the equation 

4 = («o + »;)+-• (2-229) 

v A 

As we have defined it, £ has the units of moles. In terms of f, the number of 
moles of each substance can be written 


n A = —o A (no -f n' 0 ) 4- v A £, 

(2.230) 

nB = ~VB(no + n' 0 ) + N B + o B £, 

(2.231) 

nc = vet,. 

(2.232) 


and 


no — vd£ + Nj). (2.233) 

Any changes in the concentrations due to the reaction can therefore be written 

dn A = v A d£, dn B = o B d^ dnc — v C d£, dn D = v D d^ (2.234) 
or 


dn A dnB _ dnc _ dno 
v A ob vc od 


(2.235) 


Equations (2.234) and (2.235) are very important because they tell us that any 
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changes in the thermodynamic properties of a system due to a given reaction 
can be characterized by a single variable. 

From Eq. (2.108), differential changes in the Gibbs free energy may be 
written 

m m 

dG = —S dT + V dP + ^ Hjdrij = —SdT + V dP + 'ZmK (2-236) 
j= i 

where the sum is over the species which participate in the reaction. Therefore 

(f)„ -f>' < “ j ” 

and the quantity 

m 

A = ^2 fjjUj (2.238) 

;=i 

is called the affinity (in some books the affinity is defined with an opposite sign). 
At chemical equilibrium, the Gibbs free energy must be a minimum, 

(S)l =A ° =o (2 - 239) 

(the superscript 0 denotes equilibrium) and, therefore, at chemical equilibrium 
the affinity must be zero. 

We can easily find the sign of the affinity as the system moves toward 
chemical equilibrium from the left or right. At constant P and T, the Gibbs free 
energy, G, must always decrease as the system moves toward chemical 
equilibrium (at equilibrium G is a minimum). Therefore, 

[dG]p - T= (M)pT d(<0 ' (2 ' 240) 

If the reaction goes to the right, then d£ > 0 and A < 0. If the reaction goes 
to the left, then d£ < 0 and A > 0. This decrease in the Gibbs free energy is 
due to spontaneous entropy production resulting from the chemical reactions 
(see Exercise 2.6). 

If there are r chemical reactions in the system involving species, j, then there 
will be r parameters, £*, needed to describe the rate of change of the number of 
moles, nf. 

r 

dnj = ^2 VjkdZk- 
k= 1 


(2.241) 
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Table 2.2. Values of the Chemical Poten- 
tial, p°, for Some Molecules in the Gas 
Phase at Pressure P 0 = 1 atm and Tem- 
perature Tq = 298 K 


Molecule 

p° 

(kcal/mol) 

h 2 

0.00 

HI 

0.31 

h 

4.63 

n 2 

0.00 

no 2 

12.39 

nh 3 

-3.98 

n 2 0 4 

23.49 


The sum over k is over all chemical reactions in which molecules of type j 
participate. 

Using ideal gas laws, some useful relations can be obtained for reactions in 
the gas phase. Consider a gas composed of four different kinds of molecules (A, 
B, C, and D) which undergo the reaction in Eq. (2.227). If the partial pressure of 
the ith constituent is P„ the chemical potential of the ith constituent can be 
written 


m(P h T) = ~ RT\n\(j^ ' ^1, (2.242) 

where To) is the chemical potential of the ith constituent at pressure Po 

and temperature To. Values of /z?, with Po = 1 atm and Po = 298 K, have been 
tabulated for many kinds of molecules [20]. A selection is given in Table 2.2. If 
we use Eq. (2.242), the Gibbs free energy can be written 


G (T,P,0 = r °) - Ys niR T ln 


a 


5/2 


= £n i /4’(Po,r 0 )-£>*r In 


n 5/ Vfo 

To) \P 



+ RT lnW4‘x" c V D °], 


(2.243) 
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and the affinity can be written 


A(T,P,0 = J2w. 

i i 

= Y i v i tf(Po, T o) -Y^ViRTin 

i i 




+ RT In 


VC y vd 
c C X D 


,Wa\Jvb\ 

L X A X B J 


(2.244) 


where P = Yli Pi * s the pressure and T is the temperature at which the reaction 
occurs. 

For “ideal gas reactions” the equilibrium concentrations of the reactants can 
be deduced from the condition that at equilibrium the affinity is zero, A 0 = 0. 
From Eq. (2.243) this gives the equilibrium condition 


In 


PC 


XAX 


C D 


>a| Jvb\ 

C A X B J 



where Pq — 1 atm and 7b = 298 K. Equation (2.245) is called the law of mass 
action. As we shall show in Exercise 2.1 1, we can use it to compute the value of 
the degree of reaction, and therefore the mole fractions, at which chemical 
equilibrium occurs as a function of pressure and temperature. 


^ S2.D.2. Stability 


Given the fact that the Gibbs free energy for fixed P and T is minimum at 
equilibrium, we can deduce a number of interesting general properties of 
chemical reactions. First, let us note that at equilibrium we have 


and 



(2.246) 


d 2 G\° 

. / P,T 


> o. 

ddpj 


(2.247) 


Equations (2.246) and (2.247) are statements of the fact that the Gibbs free 
energy, considered as a function of P, T, and £, is minimum at equilibrium for 
fixed T and P. 
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From the fundamental equation, H = G + TS, we obtain several important 
relations. First, let us note that at equilibrium 


fdH\° _ /as\° 
\Ot)p- T \dt) PT 


(2.248) 


[we have used Eq. (2.246)]. Thus, changes in enthalpy are proportional to the 
changes in entropy. The left-hand side of Eq. (2.248) is called the heat of 
reaction. It is the heat absorbed per unit reaction in the neighborhood of 
equilibrium. For an exothermic reaction, ( dH/d£)° PT is negative. For an 
endothermic reaction, (dH/d£)° pr is positive. From Eq. (2.109), Eq. (2.248) can 
be written 


(3L 


-T 


d ^ dT 'rt\e,r 


-T 


d_(dG\ 

dr\dt) 


P,T 




(2.249) 


For an “ideal gas reaction,” we can use Eqs. (2.244) and (2.249) to obtain an 
explicit expression for the heat of reaction. We find 


(dH\° _ 5 

\a^) PT ~2 


2> ffr +£ 

i i 


UjRT In 



-RT In 


PCj'D 


XAX 


C D 


A 4 A 


B J 


(2.250) 


If the total number of particles changes during the reaction (]>£ i/ ( ^ 0), there 
will be contributions to the heat of reaction from two sources: (1) There will be 
a change in the heat capacity of the gas due to the change in particle number, 
and (2) there will be a change in the entropy due to the change in the mixture of 
the particles. If the total number of particles remains unchanged (]T\ i/, = 0), 
the only contribution to the heat of reaction will come from the change in the 
mixture of particles (assuming we neglect changes to the heat capacity due to 
changes in the internal structure of the molecules). 

Let us now obtain some other general properties of chemical reactions. From 
the chain rule [Eq. (2.6)] we can write 


(r)A\ ( 

_ \dT)p£ _ 1 \ d t)p,T 

V<9r ) PA ( m') t ( 8a\ 

p t j 


(2.251) 


The denominator in Eq. (2.251) is always positive. Thus, at equilibrium any 
small increase in temperature causes the reaction to shift in a direction in which 
heat is absorbed. 
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Let us next note the Maxwell relation 


dA\ _ fdV\ 


dPJre \OtJ 


pj 


[cf. Eqs. (2.236) and (2.238)]. It enables us to write 


m (f) P ,r 

\dP ) TA ( dA ( cM\ 

) p T \ / p t 


(2.252) 


(2.253) 


At equilibrium an increase in pressure at fixed temperature will cause the 
reaction to shift in a direction which decreases the total volume. 


■ EXERCISE 2 . 11 . Consider the reaction 

N 2 0 4 ^ 2N0 2 , 

which occurs in the gas phase. Start initially with 1 mol of N 2 0 4 and no 
N0 2 . Assume that the reaction occurs at temperature T and pressure P. Use 
ideal gas equations for the chemical potential, (a) Compute and plot the 
Gibbs free energy G(T, P, £), as a function of the degree of reaction, £, for (i) 
P = 1 atm and T = 298 K and (ii) P = 1 atm and T = 596K. (b) Compute 
and plot the affinity, A(T , P, £), as a function of the degree of reaction, £, for 
(i) P = 1 atm and T = 298 K and (ii) P = 1 and T = 596 K. (c) What is the 
degree of reaction, £, at chemical equilibrium for P = 1 atm and temperature 
T = 298 K? How many moles of N 2 0 4 and N0 2 are present at equilibrium? 
(d) If initially the volume is Vo, what is the volume at equilibrium for P — 1 
atm and T = 298 K? (e) What is the heat of reaction for P — 1 atm and 
T = 298 K? 

Answer: The number of moles can be written /tn 2 o 4 = 1 — £ and /ino 2 = 2£. 
The mole fractions are 


•*n 2 o 4 = 


1 zA 

i+f 


*no 2 = 


2£ 

1+f 


(1) 


(a) The Gibbs free energy is 

G(T,P)=Yl n ‘M P O' r ») - E"' STI " [ (£) 5/ ' 2 (y 


+ /?7Tn 


(1-£) (1 ' {, (2 0 K) ' 


(2) 


(1 + f) 


(1+0 
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where i =(N 2 04, N0 2 ). From Table 2.2, for Pq = 1 atm and 
7o = 298 K, A^n 2 o 4 = 23.49 kcal/mol and /i^,o 2 — 12.39 kcal/mol. 
Plots of G(T,P) are given below. Chemical equilibrium occurs for 
the value of £ at the minimum of the curve. 




0.2 0 . 4 - 


0.8 1.0 i 


P = Po 


(b) The affinity is 

MT,P)=Y^ To) - ViRT In [ (0 ' (^) 

‘ RT J ^ 2 ' ] ^ 

[(1-0(1+?)]' 

Plots of A(T,P) are given in the figures. 

(c) Chemical equilibrium occurs for the value of £ at which A — 0. From 
the plot of the affinity, the equilibrium value of the degree of reaction 
is £ eq « 0.166. Thus, at equilibrium n N2 o 4 = 0.834 and n N o 2 = 0.332. 
At equilibrium the mole fractions are x Nz o 4 = (0.834/1.166) = 0.715 
and x N o 2 = (0.332/1.166) = 0.285. 

(d) Initially there are n N2 o 4 — 1 mol of N 2 0 4 and jc N o 2 — 0 m °l of N0 2 
and a total of 1 mol of gas present. At chemical equilibrium, there are 
«n 2 o 4 = 0.834 mol of N 2 0 4 and n N o 2 = 0.332 mol of NO 2 and a total 
of 1.166 mol of gas present. The reaction occurs at temperature 7b and 
pressure Pq. Therefore, the initial volume is Vo = ((l)/?7b/Po) and 
the final volume is V = (( \.166)RT 0 /P 0 ) = I.I66V0. 

(e) The heat of reaction for the reaction occurring at temperature To and 
pressure Pq is 

5 5 Fm 


§)° = §*r 0 -*r„>„ ^ 

0’s / p t T 4 -*N 2 0 4 

= 4.68 RTq. 


5 (0.285) 2 

-RTq — RTq In -L- — 

2 u [ 0.715 J 
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► S2.E. The Thermodynamics of Electrolytes [19-21] 

A very important phenomenon in biological systems is the flow of charged 
ions in electrically neutral solutions. Of particular interest is the behaviour 
of dilute solutions of salt (the solutes), such as NaCl, KC1, or CaCl 2 , in water 
(the solvent). If we denote the negative ion (the anion) as A - and the positive 
ion (the cation) as C + , the dissociation of the salt into charged ions can be 
denoted 


A~C+ c ^v a A- + v c C + (2.254) 

(e.g., CaCl 2 ^ 2Cl~-fCa + ), where v a and v c are the stoichiometric coefficients 
for the dissociation. The condition for equilibrium is 

Vac = Vatfa + V c p? c , (2.255) 

where p e a (p e c ) is the electrochemical of ion, A~ (C + ) and p ac is the chemical 
potential of the undissociated salt. Electrical neutrality of the fluid requires that 

v a z a + v c Zc = 0, (2.256) 

where z a e{z c e ) is the charge of the ion, A~ (C + ), and e is the charge of an 
electron. It is important to note that the mixture of a salt, such as NaCl, and 
water must be dealt with as a two-component system consisting of NaCl 
molecules and H 2 0 molecules. The salt can dissociate, but the numbers of 
anions and cations are not independent of one another. However, a 
thermodynamic framework can be set up to deal with the ions as separate 
entities, and that is what we will describe below. 

The chemical potential of the salt in aqueous solution is extremely 
complicated, but experiments show that it can be written in the form 

»ac(P,T,x ac ) = $ C (P,T) + RT ln(cw), (2.257) 

where a ac is called the activity and will be defined below, and p Q ac (P, T ) is the 
chemical potential of the salt in aqueous solution at temperature T and pressure 
P in the limit of infinite dilution. p Q ac (P, T) is proportional to the energy needed 
to add one salt molecule to pure water. We now define a reference value for the 
Gibbs free energy to be 

G°(P , 7\ n m ,n w ) = «*/&(/>, T) + n„n° w (P, T), (2.258) 

where n ac {n w ) is the number of moles of salt (water) molecules in the system of 
interest and (P, T) is the chemical potential of pure water. In principle, we 
can obtain a numerical value for G°. The difference between this reference 
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Gibbs free energy and the actual Gibbs free energy can be written 

G G — n ac (fi ac Mac) ^h'(Mw aO = n ac RT\n ( ot ac ) + n w ([i w f^ w )- 

(2.259) 

We can now relate the activity, a ac , for the salt molecule to activities for the 
ions. We define 


Otar = CCOO 


(2.260) 


Then 


In (aac) = v a In (a a ) + v c In (a c ). 

(2.261) 

The condition for equilibrium, Eq. (2.255), is satisfied if we 
electrochemical potentials of the ions to be 

define the 

A = P° a (P,T))+RT\n(a a ) + z a F<l> 

(2.262) 

and 


= P° c (P,T) + RT\n(a c ) + z c F(j>, 

(2.263) 


where fi° ac = + /i®, and we use the condition for charge neutrality. Here 0 is 

the electric potential, F is a Faraday, and z, is the charge of the ith ion (i = a, b). 

The quantities a a and a c are defined to be the activities of the anion and 
cation, respectively. It is found experimentally that in the limit of infinite 
dilution, a a = f a c a and a c = f c c c , where c a and c c are the concentrations 
(moles/volume) of the anions and cations, respectively. The quantities f c and f c 
are called activity coefficients. In the limit c, —> 0,f —> 1 (/ = a, c). Solutions 
for which f c = 1 and f a = 1 are said to be ideal. For ideal solutions the 
electrochemical potentials of the ions can be written 

^ a =^ a (P,T)+RT\n(c a )+ZaH (2-264) 

and 

M? = H° C (P, T) + RT]n (c c ) + z c F <t>. (2.265) 

Because of these relationships, we can write changes in the Gibbs free energy 
as 


dG = —S dT +VdP + Habdn a b + [i w dn w 

= — S dT + V dP + fJL a dn a + fi e c dn c + fj> w dn w , 


(2.266) 
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where dn a — v a dn ac and dn c — v c dn ac . Therefore, changes in the Gibbs free 
energy can be expressed either in terms of the salt or in terms of the ions. In 
Exercise 2.12, we give some examples of how these quantities can be used. 


■ EXERCISE 2.12. Consider a vessel held at constant temperature T and 
pressure P, separated into two disjoint compartments, I and II, by a 
membrane. In each compartment there is a well-stirred, dilute solution of a 
solute and a solvent. Assume that the membrane is permeable to the solute, 
but not permeable to the solvent. Compute the ratios of the concentrations of 
solute in the two compartments for the following two cases, (a) The solute is 
uncharged, but the solvents in the two compartments are different, (b) The 
solute is charged and the fluids in the two compartments are maintained at 
different electric potentials, but the solvents are the same. 


(?(kcal) 



Answer: 


(a) Denote the chemical potential of the solute in compartment I (II) as 
(/i J ) / ((/x 5 ) // ). Since the solute is uncharged, the chemical potential can 
be written fj, s = ^ + /?Tln (c s ), where (i Q s is the chemical potential of 
the solute in the solvent in the limit of infinite dilution, and c s is the 
concentration of the solute. The condition for equilibrium of the 
solute in the two compartments is 

(/*?)/ + RT\n (c') = (tf),, + RTln (cf). (1) 


Thus, at equilibrium the ratio of the concentrations of solute in the 
two compartments is 


^ = GXP 


(A#)// - (M?)/ 


RT 


(2) 
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The ratio (5 S = c^/cf is called the partition coefficient. It is a measure 
of the different solubility of solute in the two solvents. 

(b) Since the solute is charged, the chemical potential can be written 
p s = p? s + RT In (c 4 ) + where 0 is the electric potential and z s is 
the charge of the solute particles. Since the solvents are the same, the 
condition for equilibrium of the solute in the two compartment is 

RT la (c') + ZsP<t>i = RT In (cf ) + z,Hn- (3) 


Thus, at equilibrium the ratio of the concentrations of solute in the 
two compartments is 


— = exp 


z,FA<A 


RT 




(3) 


where A 4> = 4>n — 4>i- Conversely, the potential difference for a given 
ratio of concentrations is 

A* = *„-*,= gin (|). (4) 

The potential difference, A cj>, needed to maintain the concentration 
difference is called the Nemst potential. 
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PROBLEMS 

Problem 2.1. Test the following differentials for exactness. For those cases in which the 
differential is exact, find the function u(x,y). 

(aMu a =^ + ^. 

(b) du b = (y - x 2 )dx + (x + y 2 )dy. 

(c) du c = (2 y 2 — 3x)dx — Axy dy. 

Problem 2 . 2 . Consider the two differentials (1) du\ = (2xy + x 2 )dx + x 2 dy and (2) 
du 2 = y(x - 2 y)dx — x 2 dy. For both differentials, find the change in u(x,y) between two 
points, (a,b) and (x,y). Compute the change in two different ways: (a) Integrate along 
the path (a,b) — > ( x,b ) — > ( x,y ), and (b) integrate along the path ( a,b ) — >• ( a,y ) — > 
(x,y). Discuss the meaning of your results. 

Problem 2.3. Electromagnetic radiation in an evacuated vessel of volume V at 
equilibrium with the walls at temperature T (black body radiation) behaves like a gas of 
photons having internal energy U = aVT A and pressure P = (1/3 )aT 4 , where a is 
Stefan’s constant, (a) Plot the closed curve in the P-V plane for a Carnot cycle using 
blackbody radiation, (b) Derive explicitly the efficiency of a Carnot engine which uses 
blackbody radiation as its working substance. 

Problem 2.4. A Carnot engine uses a paramagnetic substance as its working substance. 
The equation of state is M = ( nDH/T ), where M is the magnetization, H is the 
magnetic field, n is the number of moles, D is a constant determined by the type of 
substance, and T is the temperature, (a) Show that the internal energy U, and therefore 
the heat capacity Cm, can only depend on the temperature and not the magnetization. 
Let us assume that Cm = C = constant, (b) Sketch a typical Carnot cycle in the M-H 
plane, (c) Compute the total heat absorbed and the total work done by the Carnot 
engine, (d) Compute the efficiency of the Carnot engine. 
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Fig. 2.18. 


Problem 2.5. Find the efficiency of the engine shown in Fig. 2.18. Assume that the 
operating substance is an ideal monatomic gas. Express your answer in terms of Vi and 
Vj- (The processes 1 — » 2 and 3 — > 4 are adiabatic. The processes 4 — ► 1 and 2 — > 3 
occur at constant volume.) 

Problem 2.6. One kilogram of water is compressed isothermally at 20 °C from 1 atm to 
20 atm. (a) How much work is required? (b) How much heat is ejected? Assume that the 
average isothermal compressibility of water during this process is kt = 0.5 x 10 4 /atm 
and the average thermal expansivity of water during this process is ap = 2 x 10~ 4 /°C. 

Problem 2.7. Compute the efficiency of the heat engine shown in Fig. 2.19. The engine 
uses a rubber band whose equation of state is J = aLT, where a is a constant, J is the 
tension, L is the length per unit mass, and T is the temperature in Kelvins. The specific 
heat (heat capacity per unit mass) is a constant, cl = c. 



Fig. 2.19. 
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Problem 2.8. Experimentally one finds that for a rubber band 



where J is the tension, a — 1.0 x 10 3 dyne/K, and L 0 — 0.5 m is the length of the band 
when no tension is applied. The mass of the rubber band is held fixed, (a) Compute 
(dL/dT) j and discuss its physical meaning, (b) Find the equation of state and show that 
dJ is an exact differential, (c) Assume that the heat capacity at constant length is 
Cl = 1 .OJ/K. Find the work necessary to stretch the band reversibly and adiabatically to 
a length of 1 m. Assume that when no tension is applied, the temperature of the band is 
T — 290 K. What is the change in temperature? 

Problem 2.9. Blackbody radiation in a box of volume V and at temperature T has 
internal energy U = aVT 4 and pressure P = (1/3 )aT 4 , where a is the Stefan-Boltzmann 
constant, (a) What is the fundamental equation for blackbody radiation (the entropy)? 
(b) Compute the chemical potential. 

Problem 2.10. Two vessels, insulated from the outside world, one of volume V\ and the 
other of volume V 2 , contain equal numbers N of the same ideal gas. The gas in each 
vessel is orginally at temperature 7). The vessels are then connected and allowed to 
reach equilibrium in such a way that the combined vessel is also insulated from the 
outside world. The final volume is V = V\ + V 2 . What is the maximum work, A Wfr ee , 
that can be obtained by connecting these insulated vessels? Express your answer in 
terms of 7), Vi , V 2 , and N. 


Problem 2.11. For a low-density gas the virial expansion can be terminated at first 
order in the density and the equation of state is 


P = 


Nk B T 


N 

1 +yB 2 (T) 


where B 2 (T) is the second virial coefficient. The heat capacity will have corrections to 
its ideal gas value. We can write it in the form 

^ 3 u , A j 2 k B m 

Cv,n = 2 Nk B y~ F \ T )- 

(a) Find the form that F(T) must have in order for the two equations to be 
thermodynamically consistent, (b) Find Cps- (c) Find the entropy and internal energy 


Problem 2.12. Prove that 



Problem 2.13. Compute the entropy, enthalpy, Helmholtz free energy, and Gibbs free 
energy of a paramagnetic substance and write them explicitly in terms of their natural 
variables when possible. Assume that the mechanical equation of state is m— (DH/T) 
and that the molar heat capacity at constant magnetization is c m = c, where m is the 
molar magnetization, H is the magnetic field, D is a constant, c is a constant, and 71s the 
temperature. 
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Problem 2.14. Compute the Helmholtz free energy for a van der Waals gas. The 
equation of state is ( P + {an 2 /V 2 )){V - nb) = nRT, where a and b are constants which 
depend on the type of gas and n is the number of moles. Assume that the heat capacity is 
C V n = (3/2 )nR. Is this a reasonable choice for the heat capacity? Should it depend on 
volume? 

Problem 2.15. Prove that (a) Kp{Cp - Cy) = TVaj, and (b) (Cp/Cy) = (kj/ks)- 

Problem 2.16. Show that Tds = c x (dT/dY) x dY + CY(dT/dx) Y dx, where x = X/n is 
the amount of extensive variable, X, per mole, c x is the heat capacity per mole at 
constant x, and c Y is the heat capacity per mole at constant Y. 

Problem 2.17. Compute the molar heat capacity cp, the compressibilities kt and k s , 
and the thermal expansivity ap for a monatomic van der Waals gas. Start from the fact 
that the mechanical equation of state is P = ( RT /(v — b)) - {a/v 2 ) and the molar heat 
capacity is c v = 3/?/2, where v = V/n is the molar volume. 

Problem 2.18. Compute the heat capacity at constant magnetic field C// >n , the 
susceptibilities % 7 > and xs,n, and the thermal expansivity oh,« for a magnetic system, 
given that the mechanical equation of state is M = nDH/T and the heat capacity is 
Cm,« = nc, where M is the magnetization, H is the magnetic field, n is the number of 
moles, D is a constant, c is the molar heat capacity, and T is the temperature. 

Problem 2.19. A material is found to have a thermal expansivity ap = (R/Pv)+ 
(a/RT 2 v) and an isothermal compressibility k t = (l/v)(Tf(P) + {b/P)), where 
v — {V/n) is the molar volume, (a) Find f{P). (b) Find the equation of state, (c) 
Under what conditions is this material stable? 

Problem 2.20. Compute the efficiency of the reversible two heat engines in Fig. 2.20. 
Which engine is the most efficient? (Note that these are not Carnot cycles. The 
efficiency of a heat engine is ij = A W tota i / A Q absorbed • ) 

Problem 2.21. It is found for a gas that kt = Tvf{P) and ap = ( Rv/P ) + ( Av/T 2 ), 
where T is the temperature, v is the molar volume, P is the pressure, A is a constant, and 
f{P) is an unknown function of P. (a) What is /(F)? (b) Find v = v(P, T ). 

Problem 2.22. A monomolecular liquid at volume Vl and pressure P L is separated from 
a gas of the same substance by a rigid wall which is permeable to the molecules, but 
does not allow liquid to pass. The volume of the gas is held fixed at Vq, but the volume 




Fig. 2.20. 
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of the liquid can be varied by moving a piston. If the pressure of the liquid is increased 
by pushing in on the piston, by how much does the pressure of the gas change? [Assume 
the liquid is incompressible (its molar volume is independent of pressure) and describe 
the gas by the ideal gas equation of state. The entire process occurs at fixed temperature 
I] 

Problem S2.1. Consider a gas obeying the Dieterici equation of state, 

nRT / na \ 

~ ( V - nb) 6Xp V VRT ) ’ 

where a and b are constants, (a) Compute the Joule coefficient, (b) Compute the Joule- 
Kelvin coefficient, (c) For the throttling process, find an equation for the inversion curve 
and sketch it. What is the maximum inversion temperature? 

Problem S2.2. An insulated box is partitioned into two compartments, each containing 
an ideal gas of different molecular species. Assume that each compartment has the same 
temperature but different number of moles, different pressure, and different volume [the 
thermodynamic variables of the ith box are (P,-, r,n,-, V,)]. The partition is suddenly 
removed and the system is allowed to reach equilibrium, (a) What are the final 
temperature and pressure? (b) What is the change in the entropy? 

Problem S2.3. Two containers, each of volume V, contain ideal gas held at temperature 
T-and pressure P. The gas in chamber 1 consists of N\ <a molecules of type a and N\ t b 
molecules of type b. The gas in chamber 2 consists of N 2<a molelcules of type a and 
molecules of type b. Assume that N\ a + N^ b — N 2 ^ a +# 2 ,b- The gases are allowed to 
mix so the final temperature is T and the final pressure is P (cf. Fig. 2 . 21 ). (a) Compute 
the entropy of mixing, (b) What is the entropy of mixing if N\ )U = N 2 , a and N\ h = N 2t b- 
(c) What is the entropy of mixing if N\ a = N 2} b and N\ h = N 2 a = 0 . Discuss your 
results for (b) and (c). 

Problem S2.4. An insulated box with fixed total volume V is partitioned into m 
insulated compartments, each containing an ideal gas of a different molecular species. 
Assume that each compartment has the same pressure but a different number of moles, a 
different temperature, and a different volume. (The thermodynamic variables for the ith 
compartment are (P, n,, 7), Vi).) If all partitions are suddenly removed and the system is 
allowed to reach equilibrium: (a) Find the final temperature and pressure, and the 
entropy of mixing. (Assume that the particles are monatomic.) (b) For the special case 
of m = 2 and parameters n\ = 1 mol, T\ = 300 K, V\ = 1 liter, n 2 = 3 mol, and V 2 = 
2 liters, obtain numerical values for all parameters in part (a). 




Fig. 2.21. 
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Problem S2.5. A tiny sack made of membrane permeable to water but not NaCl 
(sodium chloride) is filled with a 1% solution (by weight) of NaCl and water and is 
immersed in an open beaker of pure water at 38 °C at a depth of 1 ft. (a) What osmotic 
pressure is experienced by the sack? (b) What is the total pressure of the solution in the 
sack (neglect surface tension)? Assume that the sack is small enough that the pressure of 
the surrounding water can be assumed constant. (An example of such a sack is a human 
blood cell.) 

Problem S2.6. A solution of particles A and B has a Gibbs free energy 

1 tP" 1 

G(P, T, n A , n B ) =n A g A (P, T ) + n B g B (P, T)+- A ^ — + - A B b — 

Z n Z n 

+ A as + n A RT In (* A ) + n B RT In (x B ). 

n 

Initially, the solution has n A moles of A and n B moles of B. (a) If an amount, A n B , of B is 
added keeping the pressure and temperature fixed, what is the change in the chemical 
potential of A? (b) For the case A^ = \ BB = X AB , does the chemical potential of A 
increase or decrease? 

Problem S2.7. Consider the reaction 

2HI ^ 2H 2 + 1 2 , 

which occurs in the gas phase. Start initially with 2 mol of HI and 0 mol each of H 2 and 
I 2 . Assume that the reaction occurs at temperature T and pressure P. Use ideal gas 
equations for the chemical potential, (a) Compute and plot the Gibbs free energy, 
G(T, P, £), as a function of the degree of reaction, £, for (i) P — 1 atm and T = 298 K 
and (ii) P = 1 atm and T = 596 K. (b) Compute and plot the affinity, A(T, P, £), as a 
function of the degree of reaction, £, for (i) P = 1 atm and T = 298 K and (ii) P = 1 atm 
and T — 596 K. (c) What is the degree of reaction, £, at chemical equilibrium for P = 
1 atm and temperature T — 298 K? How many moles of HI, H 2 , and I 2 are present at 
equilibrium? (d) If initially the volume is Vo, what is the volume at equilibrium for 
P = 1 atm and T = 298 K? (e) What is the heat of reaction for P = 1 atm and 
T = 298 K? 

Problem S2.8. Consider the reaction 

2NH 3 ^ N 2 + 3H 2 , 

which occurs in the gas phase. Start initially with 2 mol of NH 3 and 0 mol each of H 2 
and N 2 . Assume that the reaction occurs at temperature T and pressure P. Use ideal gas 
equations for the chemical potential, (a) Compute and plot the Gibbs free energy, 
G(T, P, £), as a function of the degree of reaction, £, for (i) P = 1 atm and T = 298 K 
and (ii) P = 1 atm and T = 894 K. (b) Compute and plot the affinity, A(T,P,£ ), as a 
function of the degree of reaction, £, for (i) P = 1 atm and T = 298 K and (ii) P = 1 atm 
and T = 894 K. (c) What is the degree of reaction, £, at chemical equilibrium for P = 
1 atm and temperature T = 894 K? How many moles of HI, H 2 , and I 2 are present at 
equilibrium? (d) If initially the volume is Vo, what is the volume at equilibrium for 
P — 1 atm and T = 894 K? (e) What is the heat of reaction for P = 1 atm and 
T = 894 K? 
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THE THERMODYNAMICS OF 
PHASE TRANSITIONS 


3.A. INTRODUCTORY REMARKS 

A thermodynamic system can exist in a number of different phases whose 
macroscopic behavior can differ dramatically. Generally, systems become more 
ordered as temperature is lowered because forces of cohesion begin to 
overcome thermal motion and atoms can rearrange themselves into more 
ordered states. Phases changes occur abruptly at some critical temperature, 
although evidence that one will occur can be found on a macroscopic scale as 
the critical temperature is approached. In this chapter we will be concerned with 
the thermodynamics of phase transitions — that is, the description of phase 
transitions in terms of macroscopic variables. In later chapters we shall study 
them from a microscopic point of view. 

The first step in trying to understand the phase changes that can occur in a 
system is to map out the phase diagram for the system. At a transition point, two 
(or more) phases can coexist in equilibrium with each other. The condition for 
equilibrium between phases is obtained from the equilibrium conditions derived 
in Chapter 2. Since phases can exchange matter and energy, equilibrium occurs 
when the chemical potentials of the phases become equal for given values of Y 
and T. From the equilibrium condition, we can determine the maximum number 
of phases that can coexist and, in principle, find equations for the regions of 
coexistence (the Clausius-Clapeyron equation). 

At a phase transition the chemical potentials of the phases, and therefore the 
Gibbs free energy, must change continuously. However, phase transitions can be 
divided into two classes according the behavior of derivatives of the Gibbs free 
energy. Phase transitions which are accompanied by a discontinuous change of 
state (discontinuous first derivatives of the Gibbs free energy with respect to 
temperature and displacement) are called first-order phase transitions. Phase 
transitions which are accompanied by a continuous change of state (but 
discontinuous higher-order derivatives) are called continuous phase transitions. 
We give examples of both in this chapter. 

Classical fluids provide some of the most familiar examples of first-order 
phase transitions. The vapor-liquid, vapor-solid, and liquid-solid transitions 
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are all first order. We shall discuss the phase transitions in classical fluids in 
some detail. For the vapor-solid and vapor-liquid transitions, we can use the 
Clausius-Clapeyron equation to find explicit approximate equations for the 
coexistence curves. Since the vapor-liquid transition terminates in a critical 
point, we will focus on it and compare the observed behavior of the vapor- 
liquid coexistence region to that predicted by the van der Waals equation. 

Superconductors and superfluids are especially interesting from the 
standpoint of thermodynamics because they exhibit both first-order and 
continuous phase transitions and they provide a test for the third law of 
thermodynamics. In the absence of a magnetic field, the transition from a 
normal to a superconducting state in a metal is a continuous phase transition. It 
is a phase transition which is purely quantum mechanical in nature and results 
from a macroscopic condensation of pairs of electrons into a single quantum 
state. The superfluid transitions in liquid He 3 and liquid He 4 are of similar 
quantum origin. The superfluid transitions in liquid He 3 involve pairs of 
“dressed” He 3 atoms which condense, on a macroscopic scale, into a single 
quantum state. In liquid He 4 a macroscopic number of “dressed” He 4 atoms 
condense into the ground state. When liquid He 3 and liquid He 4 are mixed 
together, they form a binary mixture which can undergo both a continuous 
superfluid phase transition and a first-order binary phase transition. 

Most phase transitions have associated with them a critical point (the liquid- 
solid transition does not). There is a well-defined temperature above which one 
phase exists and, as the temperature is lowered, a new phase appears. When the 
new phase appears, it often has different symmetry properties and a new 
thermodynamic variable, the order parameter, appears which characterizes the 
new phase. For first-order phase transitions there need not be a connection 
between the symmetries of the high-temperature phase and those of the low- 
temperature phase. For continuous phase transitions, there is always a well- 
defined connection between the symmetry properties of the two phases. 
Ginzburg and Landau developed a completely general theory of continuous 
symmetry-breaking phase transitions which involves an analytic expansion of 
the free energy in terms of the order parameter. We shall discuss the Ginzburg- 
Landau theory in this chapter and show how it can be applied to magnetic 
systems at the Curie point and to superfluid systems. 

The critical point plays a unique role in the theory of phase transitions. As a 
system approaches its critical point from the high temperature side, it begins to 
adjust itself on a microscopic level. Large fluctuations occur which signal the 
emergence of a new order parameter which finally does appear at the critical 
point itself. At the critical point, some thermodynamic variables can become 
infinite. Critical points occur in a huge variety of systems, but regardless of the 
particular substance or mechanical variable involved, there appears to be a great 
similarity in the behaviour of all systems as they approach their critical points. 
One of the best ways to characterize the behavior of systems as they approach 
the critical point is by means of critical exponents. We shall define critical 
exponents in this chapter and give explicit examples of some of them for the 
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liquid-vapor transition in simple fluids. At the end of the chapter in the special 
topics section we define critical exponents for the Curie point. 

The section on special topics contains further applications of thermo- 
dynamics at phase transitions and at the interface of different phases. In Chapter 
2 we derived conditions for mechanical thermodynamic equilibrium between 
two parts of a system in equilibrium. If the system consists of two phases, such 
as a liquid and a gas phase, then the interface between the two phases may be 
under tension due to an imbalance of molecular forces at the interface. If the 
interface is under tension, the condition for mechanical equilibrium must be 
modified to include mechanical effects due to the surface tension. 

A rather spectacular phenomenon which occurs in superfluid systems is the 
thermomechanical, or fountain, effect. As we shall see, it is possible to use a 
heat source to drive a superfluid fountain. Even though superfluids are highly 
degenerate quantum systems, the fountain effect surprisingly can be described 
in terms of classical thermodynamics. All we need is a system composed of two 
interpenetrating fluids, one of which carries no entropy. Then simple arguments 
give us the fountain effect. 

A binary mixture of molecules in the fluid state provides a different example 
of a first-order phase transition. For that system, below a certain critical 
temperature we can have a physical separation of the mixture into two fluids, 
each of which is rich in one of the types of molecules. 

Finally we conclude the special topics section with the application of 
Ginzburg-Landau theory to superconductors in the presence of a magnetic field. 


3.B. COEXISTENCE OF PHASES: 

GIBBS PHASE RULE 

Most systems can exist in a number of different phases, each of which can 
exhibit quite different macroscopic behavior. The particular phase that is 
realized in nature for a given set of independent variables is the one with the 
lowest free energy. For certain values of the independent variables, two or more 
phases of a system can coexist. There is a simple rule, called the Gibbs phase 
rule, which tells us the number of phases that can coexist. Generally, coexisting 
phases are in thermal and mechanical equilibrium and can exchange matter. 
Under these conditions, the temperature and chemical potentials of the phases 
must be equal (cf. Section 2.H) and there will be another condition between 
mechanical variables expressing mechanical equilibrium. For example, for a 
simple PVT system, the pressures of the two phases may be equal (if surface 
tension can be neglected). 

For simplicity, let us first consider a YXT system which is pure (composed of 
one kind of particle). For a pure system, two phases, I and //, can coexist at a 
fixed value of Y and T if their respective chemical potentials are equal: 

M '(y,T) = M"( y,T). 


(3.1) 
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(The chemical potentials are functions only of intensive variables.) Equation 
(3.1) gives a relation between the values of Y and T for which the phases can 
coexist, 


Y = Y(T), (3.2) 

and in the Y-T plane it defines a coexistence curve for the two phases. If the 
pure system has three phases, I, II, and III, they can only coexist at a single 
point in the Y-T plane (the triple point). Three coexisting phases must satisfy 
the equations 


p'(y, T) = »"(Y, T) = T ). (3.3) 

Since we have two equations and two unknowns, the triple point is uniquely 
determined. For a pure system, four phases cannot coexist, because we would 
then have three equations and two unknowns and there would be no solution. 

As an example of the Gibbs phase rule for pure substances, we show the 
coexistence curves for various solid phases of water (cf. Fig. 3.1). We see that 
although water can exist in many different solid phases, no more than three 
phases can coexist at a given temperature and pressure. 

For a mixture of l different types of particles, at most l -F 2 phases can 
coexist. To show this, we note that if there are l types of particles in each phase, 
then there will be / + 1 independent variables for each phase, namely, 
(Y, T,x\, . . . , jc/_ i ) , where jc, is the mole fraction of particles of type i. If we 
have several phases coexisting, the chemical potential for a given type of 



Fig. 3.1. Coexistence curves for the solid and liquid phases of water. In accordance with 
the Gibbs phase rule, no more than three phases can coexist [1]. (Based on Ref. 2.) 
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particle must have the same value in each of the various phases. Thus, if there 
are r coexisting phases at a given value of Y and T, the condition for equilibrium 
is 


rf(Y, T,x[,4, ■ ■ ■ .*/-,) = /*"(!', T,x[‘,x", . . . ,*/',) = • • ■ (3.4) 

= /^T, Arf-i)- 

l4(Y, T,x(, . . .,*{_,) = i4'(Y, T,x\>, . . . ,*",) = ■ ■ • (3.5) 

= ^(7, r.zf,..., *£.,). 

p,'(K. T,x[, . . . ,*/_,) = tf(Y, T,x[ . . . ,*",) = • • ■ (3.6) 

= Mi(Y, T,Xi,...,x[_ t ). 

Equations (3.4)-(3.6) give Z(r — 1) equations to determine 2 + K/-1) 
unknowns. For a solution, the number of equations cannot be greater than the 
number of unknowns. Thus, we must have l(r - 1) <2 + r(l - 1) or r< / + 2. 
The number of coexisting phases must be less than or equal to / + 2, where l is 
the number of different types of particle. For a pure state, l — 1 and r ^ 3 as we 
found before. For a binary mixture, l = 2 and r<4, and, at most, four different 
phases can coexist. 


3.C. CLASSIFICATION OF PHASE TRANSITIONS 

As we change the independent intensive variables (Y, T,x\ , . . . ,xi) of a system, 
we reach values of the variables for which a phase change can occur. At such 
points the chemical potentials (which are functions only of intensive variables) 
of the phases must be equal and the phases can coexist. 

The fundamental equation for the Gibbs free energy, in a system with / 
different kinds of molecules, is 


/ 

G = 2>,Mi, (3.7) 

7=1 

where rij is the number of moles of the jth constituent, and /i ; - is its chemical 
potential (cf. Section 2.F.4). For processes which occur at constant Y and T, 
changes in the Gibbs free energy can be written 

/ 

[dG] Y j = ) ^ [ijdrij. (3.8) 

7=1 

Thus, at a phase transition, the derivatives /i ; = (dG/drij) Y T ^ j must be equal 
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Fig. 3.2. Typical behavior for the Gibbs free energy at a first-order phase transition. 


and the Gibbs free energy of each phase has the same value. However, no 
restriction is placed on the derivatives X = — (dG/dY) T ^ n .y and S = 
-(dG/dT)y { n .y The behavior of these derivatives is used to classify phase 
transitions. If the derivatives (dG/dY) T ^ n .y and ( dG/dT) Y ^ j are discontin- 
uous at the transition point (that is, if the extensive variable X and the entropy S 
have different values in the two phases), the transition is called “first-order.” If 
the derivatives (dG/dT) Y ^ and (dG/dY) T ^ are continuous at the transition 
but higher-order derivatives are discontinuous, then the phase transition is 
continuous. (The terminology “nth-order phase transition” was introduced by 
Ehrenfest to indicate a phase transition for which the nth derivative of G was the 
first discontinuous derivative. However, for some systems, higher-order 
derivatives are infinite, and the theory proposed by Ehrenfest breaks down 
for those cases.) 

Let us now plot the Gibbs free energy for first-order and continuous 
transitions in a PVT system. For such a system the Gibbs free energy must be a 
concave function of P and T (cf. Section 2.H.3). The Gibbs free energy and its 
first derivatives are plotted in Fig. 3.2 for a first-order phase transition. A 
discontinuity in (dG/dP) T ^ j. means that there is the discontinuity in the 
volume of the two phases, 1 
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Fig. 3.3. Typical behavior for the Gibbs free energy at a continuous phase transition. 
The heat capacity can exhibit a peak at the transition. 


and a discontinuity in (dG/dT) p ^ n ,y means there is a discontinuity in the 
entropy of the two phases, 




AS = S 1 - S n 


PMj) 


dG 

dT 


(3.10) 
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Since the Gibbs free energy is the same for both phases at the transition, the 
fundamental equation H = G -1- TS shows that the enthalpy of the two phases is 
different, 


AH = H ! - H u = FAS, (3.11) 

for a first-order phase transition. The enthalpy difference, AH, is also called the 
latent heat. 

For a continuous phase transition, the Gibbs free energy is continuous but its 
slope changes rapidly. This in turn leads to a peaking in the heat capacity at the 
transition point. An example is given in Fig. 3.3. For a continuous transition, 
there is no abrupt change in the entropy or the extensive variable (as a function 
of Y and F) at the transition. 

In the subsequent sections we shall give examples of first-order and 
continuous phase transitions. 


3.D. PURE PVT SYSTEMS [3-5] 

A pure PVT system is a system composed of only one type of molecule. 
The molecules generally have a repulsive core and a short-range attractive 
region outside the core. Such systems have a number of phases: a gas phase, 
a liquid phase, and various solid phases. A familiar example of a pure 
PVT system is water. Below we shall examine the properties of phase 
transitions in such systems. Since we deal with pure substances in this 
section, it is convenient to describe their phase transitions in terms of molar 
densities. 

3.D.I. Phase Diagrams 

A typical set of coexistence curves for pure substances is given in Fig. 3.4 (Note 
that Fig. 3.4 does not describe the isotopes of helium, He 3 or He 4 , which have 
superfluid phases, but it is typical of most other pure substances.) Point A on the 
diagram is the triple point, the point at which the gas, liquid, and solid phases 
can coexist. Point C is the critical point, the point at which the vaporization 
curve terminates. The fact that the vaporization curve has a critical point means 
that we can go continuously from a gas to a liquid without ever going through a 
phase transition, if we choose the right path. The fusion curve does not have a 
critical point (none has ever been found). We must go through a phase transition 
in going from the liquid to the solid state. This difference between the gas- 
liquid and liquid-solid transitions indicates that there is a much greater 
fundamental difference between liquids and solids than between liquids and 
gases, as one would expect. The major difference lies in their symmetry 
properties. Solids exhibit spatial ordering, while liquids and gases do not. (We 
will use “vapor” and “gas” interchangeably.) 
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Fig. 3.4. Coexistence curves for a typical pure PVT system. Point A is the triple point 
and point C is the critical point. The dashed line is an example of a fusion curve with 
negative slope. 



Fig. 3.5. A plot of the coexistence regions for a typical PVT system. All the phase 
transitions here are first order. The dashed lines represent isotherms. 


The transitions from gas to liquid phase, from liquid to solid phase, and from 
gas to solid phase are all first-order transitions and are accompanied by a latent 
heat and a change in volume. In Fig. 3.5, we have drawn the phase diagram in 
the P—v plane. The dashed lines are lines of constant temperature. We notice 
that the slope of the dashed lines is negative, (dP/dv) T < 0. This is a statement 
of the stability condition, kj > 0 (cf. Section 2.H.2). In the region ot 
coexistence of phases, the isotherms (dashed lines) are always flat, indicating 
that in these regions the change in volume occurs for constant P and T. 

It is interesting to plot the pressure as a function of molar volume and 
temperature in a three-dimensional figure. As we can see in Fig. 3.6, the result 
is similar to that of a mountain with ski slopes. The height of the mountain at 
any given value of v and T is the pressure. Fig. 3.6 actually corresponds to a plot 
of the equation of state for the pure system. The shaded region is the region of 
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Fig. 3.6. Three-dimensional sketch of the equation for the typical pure PVT system. 


coexistence of more than one phase. Figure 3.4 is a projection of Fig. 3.6 on the 
P-T plane and Fig. 3.5 is a projection of Fig. 3.6 on the P-v plane. 

3.D.2. Coexistence Curves: Clausius-Clapyron Equation 

The molar Gibbs free energies (the chemical potentials), g — G/n, of two 
coexisting phases (which we call / and II) of a pure PVT system must be equal. 
If we change the pressure and temperature at which the two phases coexist (that 
is, if we move to a new point on the coexistence curve), the molar Gibbs free 
energy of the two phases must change by equal amounts. Thus, dg 1 = dg n 
along the coexistence curve. We can use this fact to find an equation for the 
coexistence curve. We use Eq. (2.108) to write 

v 1 dP - s 1 dT = v 11 dP - s 11 dT (3.12) 

along the coexistence curve, where v is the molar volume and s is the molar 
entropy. Thus, 


dP\ s 7 — s 11 As 

dT )coex V 7 -V /7 Av 


(3.13) 


along the coexistence curve, where As = s 1 — s 11 is the difference in the molar 
entropy of the two phases and Av = v 7 — v /7 is the difference in the molar 
volume of the two phases. Equation (3.13) can also be written in terms of the 
latent heat, A h = T As [cf. Eq. (3.11)] so that 


Ah 


dP' 

dT 


coex 


T Av' 


(3.14) 
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Equation (3.14) is called the Clausius-Clapeyron equation. The latent heat. Ah, 
is the heat absorbed per mole in the transition from phase II to phase I. It is of 
interest to discuss the Clausius-Clapeyron equation for the three coexistence 
curves in Fig. 3.4. 


■ Exercise 3.1. Prove that the latent heat must always be positive (heat is 
absorbed) when making a transition from a low-temperature phase to a high- 
temperature phase. 



I Answer: Let us assume that phase / is the high-temperature phase and 
I phase II is the low-temperature phase. Since for fixed pressure and 
temperature the equilibrium state is a state of minimum Gibbs free energy, 
we must have G/ < G// above the transition temperature and Gj > Gu be- 
low the transition temperature. As the drawing indicates, this implies that 
(dGi/dT) p ^ < (dGn/dT) P { n .} both above and below the transition 
temperature/ Therefore Si = —(dGi/dT) p ^ > Sn = — ( dGu/dT ) p ^ 
and AS = TAH is always positive in going from the low-temperature 
phase to the high-temperature phase. 


3.D.2.a. Vaporization Curve 

If we evacuate a chamber and partially fill it with a pure substance, then for the 
temperatures and pressures along the coexistence curve (the vaporization curve) 
from the triple point to the critical point (point A to point C in Fig. 3.4) the 
vapor and liquid phases will coexist in the chamber. For a given temperature T, 
the pressure of the vapor and liquid is completely determined and is called the 
saturated vapor pressure. As we change the temperature of the system the vapor 
pressure will also change. The Clausius-Clapeyron equation tells us how the 
vapor pressure changes as a function of temperature along the coexistence 
curve. 

We can obtain a rather simple equation for the vaporization curve if we make 
some approximations. Let us assume that changes in the molar volume of the 
liquid may be neglected relative to changes in the molar volume of the vapor 
(gas) as we move along the coexistence curve, and let us assume the vapor 
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obeys the ideal gas law. Then A v^RT/P, and the Clausius-Clapeyron 
equation for the vapor pressure curve takes the form 


dP\ PAhi g 

df) m ~ RT 1 ’ 


(3.15) 


where A hi g is the latent heat of vaporization. If we assume that the latent heat 
of vaporization is roughly constant over the range of temperatures considered, 
we can integrate Eq. (3.15) to obtain 

P = P 0 e~ Ah,g/RT . (3.16) 


Thus, as the temperature is increased, the vapor pressure increases 
exponentially along the vaporization curve. Conversely, if we increase the 
pressure, the temperature of coexistence (boiling point) increases. 


■ Exercise 3.2. Compute the molar heat capacity of a vapor along the 
vaporization curve. 


Answer: Along the vaporization curve there is only one independent 
variable, which we choose to be the temperature. In general, the entropy of 
the vapor is a function of both the pressure and temperature, but along the 
vaporization curve the pressure is related to the temperature by the Clausius- 
Clapeyron equation. The molar heat capacity along the vaporization curve 
can be written 



where we have used Eqs. (2.8) and (2.112). The molar heat capacity, cp, is 
the heat capacity of the vapor held at constant pressure as we approach the 
coexistence curve. If we use the ideal gas equation of state to describe the 
properties of the vapor phase and if we use the Clausius-Clapeyron equation, 
we obtain the following expression for the molar heat capacity along the 
coexistence curve. 


A hu 

Ccoex = Cp y • 


( 2 ) 


At low enough temperatures, it is possible for the heat capacity, c coex , to be 
negative. This would mean that if the temperature of the vapor is raised and 
it is maintained in equilibrium with liquid phase, the vapor would give off 
heat. 
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3.D.2.b. Fusion Curve 

The fusion curve does not terminate at a critical point but can have either 
positive or negative slope. In Fig. 3.4 we used a solid line for the fusion curve 
with a positive slope and a dashed line for the case of negative slope. The 
Clausius-Clapeyron equation for the liquid-solid transition is 


dP\ ^ Ah s i 
dT) coex TA Vs ,' 


(3.17) 


where A v s i is the change in molar volume in going from the solid to the liquid 
phase and A h s i is the latent heat of fusion. If the volume of the solid is greater 
than that of the liquid, then Av s / will be negative and the slope, ( dP/dT) coeK , 
will be negative. For the case of a fusion curve with positive slope, if we 
increase the pressure at a fixed temperature, we simply drive the system deeper 
into the solid phase. However, if the fusion curve has a negative slope, then 
increasing the pressure at fixed temperature can drive the system into the liquid 
phase. Water is an example of a system whose fusion curve has negative slope. 
The negative slope of the fusion curve for water makes ice skating possible. As 
the skate blades exert pressure on the ice, it turns to water and the skater floats 
along on a narrow puddle. 


3.D.2.C. Sublimation Curve [3] 

If a solid is placed in an evacuated chamber and maintained at some pressure 
and temperature along the sublimation curve, a vapor will coexist in 
equilibrium with the solid phase. The Clausius-Clapeyron equation for the 
sublimation curve is 


/ dP\ A h sg 

\df) ~ T Av., 

\ / coex 


(3.18) 


where Av sg is the change in molar volume in going from the solid to the gas 
phase and A h sg is the latent heat of sublimation. If we again assume that the gas 
phases obeys the ideal gas equation of state, the Clausius-Clapeyron equation 
takes the form 



coex 


PAh sg 
RT 2 


(3-19) 


If the vapor pressure is known over a small temperature interval, then the latent 
heat of sublimation can be obtained from Eq. (3.19). We can rewrite Eq. (3.19) 
in the form 

(3.20) 

Then A h sg is proportional to the slope of the curve, ln(P) versus 1 /T. 


Ah r dla{p) 

Ahsg ~ R d(l/T)‘ 
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| EXERCISE 3.3. In the neighborhood of the triple point of ammonia 
(NH 3 ), the equation for the sublimation curve is ln(7) = 27.79 — 3726 /T 
and the equation for the vaporization curve is ln(7) = 24.10 — 3005/7, 
where 7 is measured in pascals and T is measured in kelvins. (a) Compute 
the temperature and pressure of the triple point, (b) What is the latent heat of 
sublimation? What is the latent heat of vaporization?. 


Answer: 

i 

(a) At the triple point, the pressure and temperature of the vapor, liquid, 
and solid are the same. Therefore, the equation for the triple point 
temperature, T t , is 27.79 — 3726/7, = 24.10 — 3005/7, or 7, = 
195.4 K. The triple point pressure, P t , is P t = 6.13kPa. 

. (b) The slope of the sublimation curve is 


fdP\ _ 3726 P PAh sg 

W) coe r^ x ~^' 


(i) 


Therefore, A h sg « 31 kJ/mol. The slope of the vaporization curve is 

fdP\ 30057 7Afi /g 

T 2 ~ RT 2 1 > 

Therefore, A hi g « 25 kJ/mol . 


At moderately low temperatures (but not low enough for quantum effects to 
cause significant deviations from the classical gas equation of state), the vapor 
pressure along the sublimation curve is very low. We can use this fact to obtain 
a fairly simple equation for the sublimation curve which includes temperature 
variations in the latent heat and heat capacity of the vapor and solid. Let us 
first note that infinitesimal changes in the molar enthalpy of the solid can be 
written 


dh = Tds + vdP = c P dT + v(l - Ta P )dP, (3.21) 

where a P is the thermal expansivity (cf. Eqs. (2.155), (2.1 12), (2.144)). We can 
use Eq. (3.21) to find an approximate expression for the difference between the 
enthalpy of the solid at two different points along the sublimation curve. We 
restrict ourselves to regions where the vapor pressure along the sublimation 
curve is small and we neglect pressure variations in Eq. (3.21). Note that 
dh g — dh s = ( c s p — c g P )dT along the coexistence curve. We then can integrate 
Eq- (3.21) and write 


A h sg = A h sg + 


To 


(4 


>)d7, 


(3.22) 
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where A h sg = h g — h s is the latent heat of sublimation at temperature T, 
A h® g = h g — h® is the latent heat of sublimation at temperature To, h s and h g (h® 
and h° g ) are the enthalpies of the solid and gas phases, respectively, at 
temperature T (To) and very low pressure, and Cp and c s p are the moler heat 
capacities of the gas and solid, respectively. If we now integrate the Clausius- 
Clapeyron equation, (3.21), using Eq. (3.22) for A h sg , we obtain the following 
equation for the sublimation curve: 



Equation (3.23) can be useful for extending the results of experimental 
measurements [3]. 

3.D.3. Liquid- Vapor Coexistence Region [4, 5] 

The liquid-vapor coexistence region culminates in a critical point and will be of 
special interest later. Therefore, it is useful at this point to examine the 
coexistence region under the critical point more closely. Let us redraw the 
coexistence curve (vaporization curve) for the liquid-vapor transition in the P-v 
plane (cf. Fig. 3.7). The isotherms for stable thermodynamic states are indicated 
by the solid lines. As we decrease the molar volume of the gas with temperature 
fixed at To <T C , the pressure increases until we reach the coexistence curve 
(point A). At point A, the vapor starts to condense and the pressure remains 
fixed until all vapor has changed to liquid (point B). Then the pressure begins to 
rise again. 



VB Vc VD V A V 


Fig. 3.7. The coexistence curve for the vapro-liquid coexistence region for a pure PVT 
system. 
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The amounts of liquid and vapor which coexist are given by the lever rule. 
Let us consider a system with temperature Tq < T c , pressure Pq, and total molar 
volume vd- The system is then in a state in which the liquid and vapor phases 
coexist. The total molar volume, v D , is given in terms of the molar volume of 
the liquid at point B, v/, and the molar volume of vapor at point A, v g , by 

vd — xm + x g v g , (3.24) 

where xi is the mole fraction of liquid at point D and x g is the mole fraction of 
gas at point D. If we multiply Eq. (3.24) by xi+x g = 1, we find 


(3.25) 

Xg (vd - V/) 

Equation (3.25) is called the lever rule. It tells us that the ratio of the mole 
fractions of liquid to gas at point D is equal to the inverse ratio of the distance 
between point D and points A and B. 

As long as (dv/dP) T < 0, the system is mechanically stable (cf. Eq. 
(2.179)). If we continue the isotherm, Tq, past the points A and B (the dashed 
line), we obtain curves which are mechanically stable but no longer correspond 
to a minimum of free energy. States along the dashed line at point A correspond 
to supercooled vapor states, while those along the dashed line at point B 
correspond to superheated liquid states. Such states are metastable and can be 
produced in the laboratory for very pure samples. It is possible that the 
superheated liquid curve can extend into the region of negative pressure. Such 
states can also be realized in the laboratory but require walls to maintain them. 
Systems with negative pressure are under tension and pull in on walls rather 
than push out against them. As we approach the critical temperature, the region 
of metastable states becomes smaller, and at the critical temperature it 
disappears. Thus, no metastable states can exist at the critical temperature. 
Also, as we approach the critical temperature the molar volumes of the liquid 
and vapor phases approach one another, and at the critical temperature they 
become equal. 

The actual shape of the coexistence curve in the T-p plane (p is the mass 
density) has been given by Guggenheim [6] for a variety of pure substances and 
is reproduced in Fig. 3.8. Guggenheim plots the coexistence curve in terms of 
the reduced quantitites T/T c and p/p c , where T c and p c are the critical 
temperature and density, respectively, of a given substance. The reduced 
quantities T/T c and p/p c give a measure the distance of the particular substance 
from its critical point (T c and p c are different for each substance). Most 
substances, when plotted in terms of reduced temperature and density, lie in 
approximately the same curve. This is an example of the so-called law of 
corresponding states, which says that all pure classical fluids, when described 
in terms of reduced quantities, obey the same equation of state [7]. We shall see 
this again when we return to the van der Waals equation. The reduced densities 
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Fig. 3.8. Experimental vapor-liquid coexistence curve for a variety of substances. The 
plot is of the reduced temperature versus reduced density. (Based on Ref. 6.) 


of the liquid and gas phases along the coexistence curves obey the following 
equations [6]: 


and 


Pi + Pg = , 3 

2 Pc 4 



Pi- Pg = 7 
Pc 2 



(3.26) 

(3.27) 


These equations will be useful later. 

It is possible to obtain expressions for response functions in the coexistence 
region. As an example, we will consider the molar heat capacity, c v , for a liquid 
and vapor coexisting at a fixed molar volume, vq (cf. Fig. 3.7). If we neglect any 
effects of gravity, then the system will consist of droplets of liquid in 
equilibrium with and floating in vapor. The internal energy per mole of the 
liquid at point D is m/(v#, T 0 ) and that of the vapor at point D is u g (v A , To) (the 
thermodynamic properties of the liquid and the vapor at point D are the same as 
on their respective sides of the coexistence curve). The total internal energy at 
point D is 


f/tot = Wg«(v g , To) + n t u(vi, To) 


(3.28) 
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where v g = va , v / = v B , and n g and n/ are the number of moles of gas and liquid, 
respectively, at point D. The total internal energy per mole at point D is 

«tot = X g u(v g , To) + X t u(v h To). (3.29) 


Let us now look at the variation of the internal energy with temperature along a 
line of fixed molar volume at point D (the molar heat capacity at point D), 


C v 




~\~Xi 

coex 



+ («/ — Ug) 

coex 



(3.30) 


where we have used the fact that dxi = —dx g . Equation (3.30) can be expressed 
in terms of directly measurable quantities. There are several steps involved 
which we itemize below. 


(i) First consider the quantity {dui / dT) coex . From Eq. (2.8), we can write 
fdu[\ _ (du{\ , (du{\ (dvi\ _ fduA /dvA 

w Lr w, WrlwL"’' Wtwj 

(3.31) 


where c Vl is the molar heat capacity of the liquid at point B. Similarly, 




(3.32) 


where c Vg is the molar heat capacity of the vapor at point A. 

(ii) Next consider the difference A u = u g — ui between the molar internal 
energies of the gas and liquid. From the Clausius-Clapeyron equation 
(3.14) and the fundamental equation for the enthalpy, Eq. (2.83), we can 
write 


\dT J coex TAv TAv T Av T Av T v ; 

where A h — h g — hi and Av = - v/ (AP = 0 because the pressure of 

the two coexisting phases are the same). Therefore, 


A u = u g — m 


r (i) 


(3.34) 


(iii) Finally, let us consider the quantity ( dxi/dT) coex . Since the total molar 
volume at point D can be written v# = x g v g + jc/v/, we can write 




(3.35) 
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Here we have used the fact that as we vary the temperature along the line 
vd =constant, the liquid and vapor vary along their respective sides of 
the coexistence curve. We can rewrite Eq. (3.35) in the form 




(3.36) 


We can now combine Eqs. (3.30), (3.32), (3.34), and (3.36) to obtain the 
following expression for the heat capacity along the line v D ^constant; 


Cy Xg^Vg "i~ Xg 

\ dT 


du g \ fdv g \ ( duA / 

dv g ) T \dT ) C0€X +X ‘ \dv,J T \ 


dvi 

dT 


dT) t 


dT 


(3.37) 


We now can make two final changes to Eq. (3.37). We can use the identity 


(9u g \ =T (dP g \ 
\dvgJ T \ dT ) Vg 


-P 


(3.38) 


and an analogous one for ( dui/dvi) T [cf. Eq. (2.145)]. Also, we can use the 
identity (2.8) to write 


(9P\ = {9P\ _(9P\ 

\dTJv, \dT) coex \dvJ T \dT) 


(3.39) 


and an analogous expression for ( dP/dT ) vi . If Eqs. (3.38) and (3.39) and 
the analogous expressions for the liquid are substituted into Eq. (3.37), we 
find 


c v 


— Xn 



+ Xi 


C VI 




2 1 


coex 


. (3.40) 


All quantities in Eq. (3.40) are measurable, and therefore a numerical value for 
the heat capacity can be obtained without much difficulty. Equation (3.40) will 
be useful later when we consider critical exponents. 

The heat capacity at constant volume is finite in the coexistence region. 
However, the heat capacity at constant pressure is infinite in the coexistence 
region. If we add heat to a system with coexisting liquid and vapor phases 
and keep the pressure fixed, liquid will turn to vapor but the temperature 
will not change. Thus, cp = o c in the coexistence region, while c v can remain 
finite. 
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3.D.4. The van der Waals Equation 

The van der Waals equation was first derived by van der Waals in his doctoral 
dissertation in 1873. It was the first, and to this day is the simplest, equation of 
state which exhibits many of the essential features of the liquid-vapor phase 
transition. The van der Waals equation is cubic in the molar volume and can be 
written in the form 


v 


3 



a 

+ _ v _ 



(3.41) 


An isotherm of the van der Waals equation is plotted in Fig. 3.9. For small 
values of 7 and P, the cubic equation has three distinct real roots (three values of 
v) for each value of P and T (this case is shown in Fig. 3.9). As T increases, the 
roots coalesce at a critical temperature, T c , and above T c two of the roots 
become imaginary and therefore unphysical. As T — > oo, Eq. (3.41) reduces to 
the ideal gas equation of state, v = RT / P. The critical point is the point at 
which the roots of Eq. (3.41) coalesce. It is also the point at which the critical 
isotherm {T = T C ) has a vanishing slope (dP/dv) T=Tc — 0 and an inflection 
point {d 2 P/dv 2 ) T=Tc = 0. [An inflection point is a point where the curve 
changes from convex to concave and ( d 2 P/dv 2 ) T changes sign.] If one uses the 
fact that 



and 



(3.42) 



3.9. A sketch of a typical van der Waals isotherm. The line from D to F 
corresponds to mechanical unstable states. The area, CDE, is labeled 2, and the area 
EFG is labeled 1. The area under the curve, v = v(P), between any two points, is equal 
to the difference in molar Gibbs free energy between the points. 
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at the critical point, then we obtain the following values for the temperature T c , 
pressure P c , and molar volume v c , at the critical point: 


Pc 


a 

27 &2’ 


v c = 3 h, 


8 a 

21 bR 


(3.43) 


If we introduce reduced variables P = P/P c , T = T/T c , and v = v/v c , then we 
may write the van der Waals equation in the form 


P + =2 ) (3v — 1) = 87\ 


(3.44) 


It is important to note that Eq. (3.44) is independent of a and b. We are now 
measuring pressure, volume, and temperature in terms of their distance from the 
critical point. The values of v c , T c , and P c will differ for different gases, but all 
gases obey the same equation if they are the same distance from their respective 
critical points — that is, if they have the same values of P = P/P c , T = T/T C , 
and v = v/v c . Thus, we see again the law of corresponding states. 

An unphysical aspect of the van der Waals equation is its prediction of 
positive slope, ( dP/dv ) T , for certain segments of the isotherms below T c (the 
segment between D and F in Fig. (3.9). This region corresponds to 
mechanically unstable thermodynamic states. However, the unphysical parts 
of the P-V curve can be removed by use of the so-called Maxwell construction, 
which we will now describe. 

From Eq. (2.108), we can write the equation for infinitesimal changes in the 
molar Gibbs free energy in the form 

dg = -sdT + vdP. (3.45) 

If we now restrict ourselves to one of the van der Waals isotherms so dT = 0, 
we can determine how g varies with pressure along that isotherm. In Fig. 3.9 we 
plot the molar volume as a function of pressure along a typical van der Waals 
isotherm, and in Fig. 3.10 we plot the molar Gibbs free energy as a function of 
pressure for the isotherm in Fig. 3.9. Along the isotherm the difference in molar 
Gibbs free energy between any two points is equal to the area under the curve, 
v = v(P), between those two points: 


8 2 ~ g 1 



(3.46) 


The Gibbs free energy increases and is concave between A and D. Between D 
and F it decreases and is convex. Then between F and I it becomes concave 
again and increases. We see that between D and F the states are mechanically 
unstable since mechanical stability requires that g be concave (cf. Section 
2.H.3). The regions from A to D and from F to I are both mechanically stable 
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isotherm in Fig. 3.9. 


pressure for the 


since g is concave. However, only the curve ACI in Fig. 3.10 corresponds to 
states in thermodynamic equilibrium because for these states the Gibbs free 
energy is a minimum. The states FCD are metastable. The equilibrium states 
thus correspond to those states whose Gibbs free enegy has values lying along 
the curve ACI. To obtain the equilibrium states on the isotherm between C and 
G we must draw a straight line (line of constant pressure) between them, since 
this is the only way the Gibbs free energy will remain constant in going from C 
to G. The physical isotherm (isotherm containing equilibrium states) is the line 
ABCEGHI in Fig. (3.9). 

Before we can complete our construction of isotherms, we must decide 
where C and G lie. For the points C and G, the molar Gibbs free energies are 
equal. Thus 


0 = 


'Pc 

Pc 


v(P) dP 


'Pd cPe 

v(P)dP+\ v(P)dP 

Pc JPd 


+ 


'Pf fPg 

v{P) dP + v(P) dP 
Jp E Jp F 


(3.47) 


or, after rearranging, 




v(P) dP 


cPe 


v{P) dP 

Jp F 


rPc 

v(P) dP. 


Pf 


(3.48) 


The left-hand side is equal to area 2 in Fig. 3.9 and the right-hand side is equal 
to area 1. Thus, the line from C to G must be drawn so that the areas 1 and 2 are 
equal: 


Area 1 = Area 2. 


(3.49) 
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If this is done, the curve ACEGI then gives the equilibrium states of the system. 
The condition given in Eq. (3.49) is called the Maxwell construction. Thus, with 
the Maxwell construction, we obtain the equilibrium isotherms from the van der 
Waals equation and the curves for metastable states. 


3.E. SUPERCONDUCTORS [8-10] 

Superconductivity was first observed in 1911 by Kamerlingh Onnes. He found 
that the resistance to current flow in mercury drops to zero at about 4.2 K (cf. 
Fig. 3.11). At first this was interpreted as a transition to a state with infinite 
conductivity. However, infinite conductivity imposes certain conditions on the 
magnetic field which were not subsequently observed. The relation between the 
electric current, J, and the applied electric field, E, in a metal is given by Ohm’s 
law, 


J - &E, (3.50) 

where o is the conductivity. The electric field E is related to the magnetic field 
B by Faraday’s law, 


V,xE = - f (3.51) 

If we substitute Eq. (3.50) into Eq. (3.51), we see that for infinite conductivity 
a — > oo and dYl/dt — 0. This in turn implies that the state of the system 
depends on its history. If we first cool the sample below the transition 
temperature and then apply an external magnetic field, H, surface currents must 
be created in the sample to keep any field from entering the sample, since B 
must remain zero inside (cf. Fig. 3.12). However, if we place the sample in the 
H-field before cooling, a B-field is created inside. Then, if we cool the sample, 
the B-field must stay inside. Thus, the final states depend on how we prepare the 
sample. With the hypothesis of infinite conductivity, the state below the 


R (ohms) 



Fig. 3.11. The resistance of mercury 
drops to zero at about 4.2 K. (Based on 
Ref. 10.) 
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Fig. 3.12. A superconducting sample has different behavior if it is a perfect conductor 
or perfect diamagnet (the circle is the sample). 


transition temperature cannot be a thermodynamic state since it depends on 
history. 

In 1933, Meissner and Ochsenfeld [11] cooled a monocrystal of tin in a 
magnetic field and found that the field inside the sample was expelled below the 
transition point for tin. This is contrary to what is expected if the transition is to 
a state with infinite conductivity; it instead implies a transition to a state of 
perfect diamagnetism, B = 0. It is now known that superconductors are perfect 
diamagnets. When superconducting metals are cooled below their transition 
point in the presence of a magnetic field, currents are set up on the surface of 
the sample in such a way that the magnetic fields created by the currents cancel 
any magnetic fields initially inside the medium. Thus B = 0 inside a 
superconducting sample regardless of the history of its preparation. 

No electric field is necessary to cause a current to flow in a superconductor. 
A magnetic field is sufficient. In a normal conductor, an electric field causes 
electrons to move at a constant average velocity because interaction with lattice 
impurities acts as a friction which removes energy from the electron current. In 
a superconductor, an electric field accelerates part of the electrons in the metal. 
No significant frictional effects act to slow them down. This behavior is 
reminiscent of the frictionless superflow observed in liquid He 4 below 2.19 K 
(cf. Section S3.B). Indeed, the superfluid flow in He 4 and the supercurrents in 
superconductors are related phenomena. The origin of the apparently 
frictionless flow in both cases lies in quantum mechanics. It is now known 
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that electrons in a superconducting metal can experience an effective attractive 
interaction due to interaction with lattice phonons. Because of this attraction, a 
fraction of the electrons (we never know which ones) can form “bound pairs.” 
The state of minimum free energy is the one in which the bound pairs all have 
ths same quantum numbers. Thus, the bound pairs form a single macro- 
scopically occupied quantum state which acts coherently and forms the 
condensed phase. As we shall see, the order parameter of the condensed phase 
behaves like an effective wave function of the pairs. Because the pairs in the 
condensed phase act coherently (as one state), any friction effects due to lattice 
impurities must act on the entire phase (which will contain many pairs and have 
a large mass) and not on a single pair. Thus, when an electric field is applied, 
the condensed phase moves as a whole and is not slowed significantly by 
frictional effects. 

The condensed phase flows at a steady rate when a magnetic field is present, 
and it is accelerated when an electric field is applied. Just below the transition 
temperature, only a small fraction of electrons in the metal are condensed and 
participate in superflow. As temperature is lowered, thermal effects which tend 
to destroy the condensed phase become less important and a larger fraction of 
the electrons condense. 

If a superconductor is placed in a large enough external magnetic field, the 
superconducting state can be destroyed. A plot of magnetic induction, B, versus 
applied field, H, appears as in Fig. 3.13. For applied field, H, with a value less 
than some temperature-dependent critical value, H coex {T), the system is a 
perfect diamagnet. That is, the permeability fi = 0 and therefore B = 0. 
However, for H > H coex (T ) the system becomes normal and B = fiH. (For 
normal metals /i « /x 0 , where no is the permeability of the vacuum.) Thus, 


f 0 if H< H coex (T ) 

1 imH if H>H coex {T) 


(3.52) 


B 


MoH 



Fig. 3.13. A plot of the magnetic induction, B, versus the applied magnetic field, H, in a 
superconductor. 
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H 



Fig. 3.14. The coexistence curve for normal and superconducting states. 


The field, H coex (T) lies on the coexistence curve for the two phases. It has been 
measured as a function of the temperature and has roughly the same behaviour 
for most metals (cf. Fig. 3.14). The coexistence curve for the normal and 
superconducting phases is well approximated by the equation 

*Wr) = flb(l-|j), (3.53) 

where T c is the critical temperature when no external fields are present. The 
slope {dH/dT) coex ~ 0 at T = 0K and is negative at T = T c . The phase 
diagram for a superconducting metal has analogies to the vapor-liquid 
transition in a PVT system, if we let H coex replace the specific volume. Inside 
the coexistence curve, condensate begins to appear. 

Along the coexistence curve, the chemical potentials of the superconducting 
and normal phases must be equal and, therefore, any changes in the chemical 
potentials must be equal. Thus, along the coexistence curve 


-s n dT - B n dH = —s s dT - B s dH (3.54) 


or 




IA)Hcoex{T) 



(3.55) 


Equation (3.55) is the Clausius-Clapeyron equation for superconductors. We 
have used the fact that B s = 0 and B n = iM)H coex (T ) on the coexistence curve. 
Eere s n ^ is the entropy per unit volume of the normal (superconducting) phase. 
We see that the transition has a latent heat (is first order) for all temperatures 
except T — T c where H coex = 0. When no external magnetic fields are present, 
the transition is continuous. 
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The change in the heat capacity per unit volume at the transition is 


(Oi g) 


coex 


,d(s n - S s ) 


dT 






(3.56) 


We have used Eq. (3.53) to evaluate the derivatives ( dH/dT) coex . At low 
temperatures the heat capacity of the normal phase is higher than that of the 
superconducting phase. At T — T c (the critical point) the heat capacity is higher 
in the superconductor and has a finite jump, (c 5 — c n ) T=T = (4^o/T c )Hq. It is 
worthwhile noting that as T — > 0, (s s - s n ) — * 0 since ( dH/dT) coex — * 0 as 
T —> 0. This is in agreement with the third law of thermodynamics. 

It is useful to obtain the difference between the Gibbs free energies of the 
normal and superconducting phases for H = 0. The differential of the Gibbs 
free energy per unit volume, g, is 


dg — —sdT — BdH. 


(3.57) 


If we integrate Eq. (3.57) at a fixed temperature, we can write 


g(T,H) - g(T,0) = - 


* H 

BdH. 


o 


For the normal phase we have 

For the superconducting phase (H < H coex ), we have 


& (7\ff)-s,(7\0) = 0 


(3.58) 


(3.59) 


(3.60) 


since B = 0 inside the superconductor. If we next use the fact that 


gn{T, H coex ) = gs(T, Hcoex), (3.61) 

we obtain the desired result 

g.(T,0) = g„(T,0)-^til a (T). (3.62) 

Thus, the Gibbs free energy per unit volume of the superconductor in the 
absence of a magnetic field is lower than that of the normal phase by a 
factor fio/2 H^ oex (T) (the so-called condensation energy) at a temperature T. 
Since the Gibbs free energy must be a minimum for fixed H and T, the 
condensed phase is a physically realized state. Note that g s (T c , 0) = g n {T c , 0) as 
it should. 
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3.F. THE HELIUM LIQUIDS 

From the standpoint of statistical physics, helium has proven to be one of the 
most unique and interesting elements in nature. Because of its small atomic 
mass and weak attractive interaction, helium remains in the liquid state for a 
wide range of pressures and for temperatures down to the lowest measured 
values. 

The helium atom occurs in nature in two stable isotopic forms, He 3 and He 4 . 
He 3 , with nuclear spin (1/2), obeys Fermi-Dirac statistics; while He 4 , with 
nuclear spin 0, obeys Bose-Einstein statistics. At very low temperatures, where 
quantum effects become important, He 3 and He 4 provide two of the few 
examples in nature of quantum liquids. 

Chemically, He 3 and He 4 are virtually identical. The only difference between 
them is a difference in mass. However, at low temperatures the two systems 
exhibit very different behavior due to the difference in their statistics. Liquid 
He 4 , which is a boson liquid, exhibits a rather straightforward transition to a 
superfluid state at 2.19 K. This can be understood as a condensation of particles 
into a single quantum state. Liquid He 3 also undergoes a transition to a 
superfluid state, but at a much lower temperature (2.7 x 10 3 K). The 
mechanism for the superfluid transition in liquid He 3 is quite different from 
that of liquid He 4 . In liquid He 3 , particles (more accurately, quasiparticles) form 
bound pairs with a spin s = 1 and relative angular momentum, / = 1 . The 
mechanism is similar to the formation of bound pairs in a superconductor 
except that the pairs in a superconductor are formed with spin s = 0 and angular 
momentum l = 0. Thus, the bound pairs in a superconductor are spherical with 
no magnetic moment, while those in liquid He 3 are flatter along one axis, carry 
angular momentum, and have a net magnetic moment. The fact that the bound 
pairs in liquid He 3 have structure leads to many fascinating effects never before 
observed in any other physical system. 

When He 3 and He 4 are combined to form a binary mixture, a new type of 
phase point occurs (called a tricritical point) in which a A-line connects to the 
critical point of a binary phase transition. 

While we cannot discuss the theory of these systems at this point, it is worth- 
while to look at their phase diagrams since they present such a contrast to those 
of classical fluids and they tend to confirm the third law. 


3.F.I. Liquid He 4 [12-14] 

He 4 was first liquefied in 1908 by Kamerlingh Onnes at a temperature of 
4.215 K at a pressure of 1 atm. Unlike the classical liquids we described in 
Section 3.D, it has two triple points. The coexistence curves for liquid He 4 are 
shown in Fig. 3.15 (compare them with the coexistence curve for a classical 
liquid in Fig. 3.4). He 4 at low temperature has four phases. The solid phase only 
a ppears for pressures above 25 atm, and the transition between the liquid and 
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P (atm) 



Fig. 3.15. The coexistence curves for He 4 . 


solid phases is first order. The liquid phase continues down to temperatures 
approaching T = 0 K. However, there are in fact two liquid phases. As the 
normal liquid [liquid He(I)] is cooled, a line of A-points occurs at about 
T — 2 K (the exact temperature depends on the pressure), indicating that a 
continuous symmetry-breaking phase transition occurs at this line. There is a 
triple point at each end of the — line. The symmetry that is broken is gauge 
symmetry. Below the A-line, the liquid phase (which was called liquid He(II) by 
Keesom and Wolfke [15]) begins to exhibit very strange properties. The first 
experimenters who worked with liquid He 4 found that it was able to leak out of 
their containers through cracks so tiny that even He 4 gas could not leak through. 
This apparently frictionless flow is a consequence of the fact that the condensed 
phase is a highly coherent macroscopic quantum state. It is analogous to the 
apparent frictionless flow of the condensed phase in superconductors. The order 
parameter for the condensed phase in liquid He 4 is a macroscopic “wave 
function,” and the Ginzburg-Landau theory for the condensed phase of He 4 is 
very similar to that of the condensed phase in superconductors, except that in 
liquid He 4 the particles are not charged. The specific heat of liquid He 4 along 
the A-line is shown in Fig. 3.16. We can see that it has the lambda shape 
characteristic of a continuous phase transition. 

The phase diagram of He 4 provides a good example of the third law. The 
vapor-liquid and solid-liquid coexistence curves approach the P-axis with zero 
slope. From Eqs. (2.54) and (2.100), we see that this is a consequence of the 
third law. 

3.F.2. Liquid He 3 [16-18] 

The He 3 atom is the rarer of the two helium isotopes. Its relative abundance in 
natural helium gas is one part in a million. Therefore, in order to obtain it in 
large quantities, it must be “grown” artificially from tritium solutions through 
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Fig. 3.16. The specific heat of He 4 at vapor pressure at the A-point [12]. 


/3-decay of the tritium atom. Thus, He 3 was not obtainable in large enough 
quantities to study until the late 1940s and it was first liquefied in 1948 by 
Sydoriack, Grilly, and Hammel [19]. Since the He 3 atom has only (3/4) the 
mass of a He 4 atom, it has a larger zero point energy than the He 4 atom. As a 
result, He 3 boils at temperature about 25% lower than He 4 , and it requires a 
pressure about 25% greater than that of He 4 to solidify. 

The phase diagram for He 3 (on the same scale as that for He 4 ) is given in 
Fig. 3.17. On this scale there appears to be no transition to a superfluid state. 
There is, however, a minimum in the liquid-solid coexistence curve. This is 
attributed to the spin of the He 3 atom. At low temperature the spin lattice of the 
He 3 solid has a higher entropy than the liquid. The entropy difference, 


P (atm) 



Fig. 3.17. Coexistence curves for He 3 [16]. 
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P (atm) 



Fig. 3.18. Coexistence curves for superfluid phases of He 3 when no magnetic field is 
applied [16]. 


AS = liquid — iSsoiid, is positive at high temperature, vanishes at about 
T — 0.3 K and becomes negative below 0.3 K. Since volume differences 
remain virtually unchanged, the Clausius-Clapeyron equation dP/dT = 
AS/AV leads to a positive slope at high temperature and a negative slope at 
low temperature. At low temperature, if the third law is to be satisfied, the slope 
of the liquid-solid coexistence curve must become flat as T — ► 0 K. 

Superfluidity was first observed in liquid He 3 in 1971 by Osheroff, 
Richardson, and Lee [20]. In 1997, they received the Nobel prize for this 
work. The transition occurs at 2.7 x 10 -3 K at a pressure of about 34 atm. The 
phase diagram for a small temperature interval is shown in Fig. 3.18. There are, 
in fact, several superfluid phases in liquid He 3 , depending on how the bound 
pairs orient themselves. The so-called A-phase is an anisotropic phase. The 
bound pairs (which have shape) all orient on the average in the same direction. 
This defines a unique axis in the fluid and all macroscopic properties depend on 
their orientation with respect to that axis. The B-phase is a more isotropic phase 
and has many features in common with the superfluid phase of a super- 
conductor. If we apply a magnetic field to liquid He 3 , a third superfluid phase 
appears. The transition between the normal and superfluid phases appears to be 
continuous, while that between the A and B superfluid phases appears to be first 
order. 

3.F.3. Liquid He 3 -He 4 Mixtures 

When He 3 and He 4 are mixed together and condensed to the liquid state, some 
interesting phenomena occur. We will let jc 3 denote the mole fraction of He 3 . In 
1949, Abraham, Weinstock, and Osborne [21] showed that He 3 -He 4 mixtures 
can undergo a transition to a superfluid state. In this early experiment, they 
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X3 


Fig. 3.19. The phase diagram for a liquid He 3 -He 4 mixture plotted as a function of 
temperature T and mole fraction x 3 of He 3 . 


found that the A-line extended from T = 2. 19 K for x 3 = 0 to about T — 1 .56 K 
for jc 3 = 0.282. Later experiments extended the A-line down to T — 0.87 K for 
Xj = 0.67 (cf. Fig. 3.19). 

In 1954, Prigogine, Bingen, and Bellemans [22] predicted the existence of a 
phase separation of liquid He 3 -He 4 mixtures into an He 3 -rich phase and an He 4 - 
rich phase. (The theory of binary phase separation in classical mixtures is 
discussed in Sect. (S3.E).) This phase separation was found in 1956 by Walters 
and Fairbank [23] using nuclear magnetic resonance techniques. They were 
able to measure the magnetic susceptibility of the liquid in different vertical 
layers of the mixture. In the coexistence region the magnetic susceptibility, 
which is a measure of the concentration of He 3 atoms varies with height. The 
critical point for this binary phase transition lies at the end of the A-line at 
T = 0.87 K and x 3 = 0.67. The phase transition along the A-line is second 
order. The binary phase separation is a first-order phase transition. In 1982, the 
region of metastable states for this first-order phase transition was measured by 
Alpem, Benda, and Leiderer [24]. 

The end point of the A-line is the critical point of the first-order binary 
phase transition. It was first called a tricritical point by Griffiths [25]. He 
pointed out that in a suitable space it is meeting point of three lines of 
second-order phase transitions. One line is the A-line. The other two are lines of 
critical points associated with the first-order phase transition. Thus, the 
tricritical point is different from the triple points we have seen in classical 
fluids. 
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3.G. GINZBURG-LANDAU THEORY [26] 

In the late 1930s, Ginzburg and Landau proposed a mean field theory of 
continuous phase transitions which relates the order parameter to the underlying 
symmetries of the system. One of the features which distinguishes first-order 
and continuous phase transitions is the behavior of the order parameter at the 
transition point. In a first-order phase transition, the order parameter changes 
discontinuously as one crosses the coexistence curve (except at the critical 
point). Also, first-order phase transitions may or may not involve the breaking 
of a symmetry of the system. For example, in the liquid-solid and vapor-solid 
transitions, the translational symmetry of the high-temperature phase (liquid or 
vapor) is broken, but for the vapor-liquid transition no symmetry of the system 
is broken. In the liquid and gas phases, the average particle density is 
independent of position and therefore is invariant under all elements of the 
translation group. The solid phase, however, has a periodic average density and 
is translationally invariant only with respect to a subgroup of the translation 
group. Solids may also exhibit first-order phase transitions in which the lattice 
structure undergoes a sudden rearrangement from one symmetry to another and 
the state of the solid changes discontinuously. 

At a first-order phase transition, the slope of the free energy curve changes 
discontinuously as a function of Y and T at the transition point and a symmetry 
may or may not be broken. At a continuous phase transition, the slope of the 
free energy curve changes continuously and a symmetry is always broken. In 
such transitions, a new macroscopic parameter (the order parameter) appears in 
the less symmetric phase. The order parameter may be a scalar, a vector, a 
tensor, a complex number or some other quantity. The form of the order 
parameter is determined by the type of symmetry that is broken (an extensive 
discussion of this point may be found in Ref. 26). For example, in the transition 
from a paramagnetic to a ferromagnetic system, rotational symmetry is broken 
because a spontaneous magnetization occurs which defines a unique direction in 
space. The order parameter is a vector. In the transition from normal liquid He 4 
to superfluid liquid He 4 , gauge symmetry is broken. The order parameter is a 
complex scalar. In a solid, the lattice might begin to undergo a gradual 
reorientation as the temperature is lowered. The order parameter is the change 
in the spatially varying number density. In continuous transitions, one phase 
will always have a lower symmetry than the other. Usually the lower 
temperature phase is less symmetric, but this need not always be the case. 

All transitions which involve a broken symmetry and a continuous change in 
the slope of the free energy curve can be described within the framework of a 
mean field theory due to Ginzburg and Landau [26]. Ginzburg-Landau theory 
does not describe all features of continuous phase transitions correctly, but it 
does give us a good starting point for understanding such transitions. Near a 
continuous phase transition the free energy is an analytic function of the order 
parameter associated with the less symmetric phase. In real systems, the free 
energy is not an analytic function of the order parameter near the critical 
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point. Nevertheless, Ginzburg-Landau theory still describes enough qualitative 
features of continuous phase transitions that it is well worth studying. 

We will let 77 denote the order parameter and let 0 denote the free energy and 
write 

4>(T, YJ) = MT, Y) + <xi(T, Y)rf + a } (T, Y)rf + a 4 (T, Y)rf + ■■■ -ft), 

(3.63) 

where yis a generalized thermodynamic force and /is the force conjugate to the 
order parameter (one can usually be found, even though it may only be a 
mathematical construct and not realizable in the laboratory). There is no term in 
Eq. (3.63) which is first order in 77 because that would ensure a nonzero value 
for the order parameter above the transition point. The molar free energy, 
</> 0 (r, y), describes the thermodynamic quantities not directly involved in the 
transition and generally will depend on other state variables. For example, if we 
are studying the transition from the paramagnetic to the ferromagnetic state of a 
magnetic crystal, free energy 0 o(T, y) would contain all the nonmagnetic 
contributions to the thermodynamic state of the crystal. 

The actual value of the order parameter, 77 , which is realized in nature is the 
one that gives a minimum of the free energy. By plotting the free energy as a 
function of 77 for various choices of the coefficients a 2 , £*3, a 4 , and /, it is easy to 
see under what conditions various types of phase transition can occur. The types 
of terms which are present in the free energy are determined by the symmetry 
properties of a system and the fact that the free energy is a scalar quantity. If the 
order parameter is a vector or tensor, then only invariant combinations can 
appear in the free energy. For example, the order parameter for a magnetic 
crystal is the magnetization. The magnetization, which is a vector, changes sign 
under time reversal. However, the free energy must be invariant under time 
reversal, and therefore a cubic term in the magnetization is ruled out. There are 
a number of different types of phase transition which can be described by the 
free energy in Eq. (3.63). We shall now consider several of these. 

3.G.I. Continuous Phase Transitions 

A continuous phase transition can occur if the cubic term in Eq. (3.63) cannot 
appear and if no external field is present. The free energy then takes the form 

0(7, y, 77 ) = MT, Y ) + a 2 (T, Y)rf + a 4 (7\ y) 77 4 + • • • (3.64) 

The dependence of a 2 (T, Y ) on temperature is chosen so that for temperatures 
above and at the critical temperature the free energy will only be minimized for 
V = 0 , while below the critical temperature it will be minimized for I 77 I > 0 . 

In general, the free energy will be minimum if 
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Equations (3.65) give us conditions that must be satisfied for equilibrium states. 
The right-hand equation in (3.65) is the condition for stability (not all stable 
states are equilibrium states). In order to have global stability, we must have 

a 4 (T, Y ) > 0. (3.66) 

This will ensure that as we increase rj to very large values, the free energy will 
continue to increase. The critical point occurs when a 2 (T, Y) = 0. This happens 
at a temperature T = T C (Y). If the critical temperature is a function of another 
variable, Y, then there will be a line of critical points in the ( T , Y) plane. If we 
choose a 2 (T , Y) > 0 for T > T C (Y) and a 2 (T, Y) < 0 for T < T C (Y), then the 
free energy, <h, will have its minimum value at rj = 0 when T > T C {Y) and will 
have its minimum value for rj ^ 0 when T <T C (Y). Since the free energy must 
vary continuously through the transition point, at T = T C (Y) we must have 
a 2 (T c , Y) = 0. We can combine all this information if we write a 2 (T,Y), in the 
neighborhood of the transition point, in the form 

a 2 (T, Y) = ao(7\ Y)(T - T C (Y)), (3.67) 


where «o is a slowly varying function of T and Y. 

In Fig. 3.20, we sketch the free energy for three values of a 2 . In curve (A), 
the free energy has a minimum for rj = 0. Curve (B) shows the critical point. 
The free energy becomes flattened in the neighborhood of r] = 0. In curve (C), 
the free energy has minima at rj = ±rj 0 ^ 0. The system will randomly select 
one of these two nonzero values of the order parameter, rj, below the critical 
point. The region on curve (C) for which {d 2 (f>/drj 2 ) T Y < 0 corresponds to a 
region of unstable states. The free energy has extrema when 

(^ 77 ) = 2c * 27? + 4c * 47 ? 3 ~ 0 (3.68) 



Fig. 3.20. The behavior of the free energy, $ = ajr} 2 + a 4 rj 4 , for a continuous phase 
transition for 014 = 4.0 and (A) a 2 = 0.6, (B) a 2 = 0.0, and (C) a 2 = —0.6. In the 
figure, rjc = 2.7386. 
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and therefore when 


v=0 01 (3 - 69) 


When T > T c , the minimum occurs for r] = 0. When T < T c , the minimum 
occurs for r) = ±^(ao/2a 4 )(T c - T). Thus, below t he critic al temperature, the 
order parameter is nonzero and increases as \JT C — T. From the above 
discussion, the free energy takes the following form: 


<t>(T,Y,ri) = MT,Y) for T > T c , 

<kt, y, v) = Mr, y) - a ° ( 4 Q ~ — for T < 


(3.70) 


where we have suppressed the dependence of T c on Y and the dependence of cko 
and ot 4 on T and Y. 

The molar heat capacity is 


Cy = 



(3.71) 


If we neglect derivatives of « 0 and a 4 (we assume that they vary slowly with 
temperature), we find that the molar heat capacity has a finite jump at the 
critical point: 


cy(T-)-cy(T;) = g. (3.72) 

The jump in the heat capacity has the shape of a A, as shown in Fig. 3.21, and 
therefore the critical point for a continuous phase transition is sometimes called 
a A-point. 

The transition from normal to superfluid in liquid He 4 is an example of a 
continuous phase transition. The order parameter, 77 , is the macroscopic wave 
function, 'k, for the condensed phase and the generalized force, Y, is just the 
pressure, P (the force P is not conjugate to $0. The free energy can be written 

4>(T, P, tf) = MT, P) + «2|9f + a 4 |’P| 4 + ■ • • , (3.73) 

where a 2 (T,P) = ao(T,P)(T - T c ) and ao(T,P) and a 4 (T,P) are slowly 
varying functions of T and P. The order param eter, vp = 0, above the critical 
temperature, and ^ = e‘ e ^(ao/2a 4 )(T c — T) below the critical temperature. 
The phase factor, 0, can be choosen to be zero as long as no currents flow in the 
system. As we see in Fig. 3.15 there is in fact a line of continuous transition 
points in the (P, T) plane. In Fig. 3.16 we showed the behavior of the heat 
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Fig. 3.21. The jump in the heat capacity at the critical point (lambda point) as predicted 
by Landau theory. 


capacity as we passed through the line of critical points. As we see from the 
figure, there is a finite lambda-shaped jump in the heat capacity of the liquid. 

If we turn on an external force,/, which couples to the order parameter, then 
the continuous phase transtion is destroyed. In the presence of an external force, 
the free energy has the form 

Y,f) = 0(r, Y, ri) -f V = MT, Y) + + • • • -f V , (3.74) 

where <22 = ai 2 (T, Y ) and a 4 = a 4 (T, 7). The order parameter is nonzero for all 
temperatures. A plot of the free energy in the presence of a force is shown in 
Fig. 3.22 for the same parameters as Fig. 3.20. 



Fig. 3.22. The behavior of the free energy, cj) = a 2 r) 2 + a 4 rf —ft), for a continuous 
phase transition for a 4 = 4.0, / = 0.06, and (A) a 2 = 0.6, (B) a 2 = 0.0, and (C) 
ol 2 = —0.6. r)A,r)B, and rjc locate the minima of the curves. In the figure m = 
0.0485, t)b = 0.1554, and rjc = 0.2961. 
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From Eq. (3.74), we can obtain the susceptibility, x — ( drj/df) TY = 
-(cPcf)' /df 2 ) T Y . The equilibrium state is a solution of the equation 

m =2a 2l? + W-/ = 0. (3.75) 

If we take the derivative of Eq. (3.75) with respect to/and solve for (drj/df) T Y , 
we obtain 


(^/) T,r 2«2 + 12a 4 ^' * 3 ' 76 ^ 

In the limit, / — ► 0, r) = 0 for T > T c and rj = for T < T c . 

Therefore, in the limit / — ► 0 the susceptibility will be different above and 
below the critical point. We find 


r ( dr >\ 

x =n\ef) T = 

X = lim = 

f—>o \df J j Y 


1 _ 1 

2q!2 2q!o (T — T c ) 

1 1 

4c* 2 4ao(T c - T) 


for T > T c , 


for T < T c . 


(3.77) 


Note that the susceptibility diverges at the critical point. 

The transition from a paramagnetic to ferromagnetic system is one of the 
simplest examples of a continuous phase transition. A system which exhibits 
this behaviour is a magnetic solid, such as nickel, whose lattice sites contain 
atoms with a magnetic moment. The critical temperature is called the Curie 
temperature. Above the Curie temperature, the magnetic moments are oriented 
at random and there is no net magnetization. However, as the temperature is 
lowered, magnetic interaction energy between lattice sites becomes more 
important than randomizing thermal energy. Below the Curie temperature, the 
magnetic moments became ordered on the average and a spontaneous 
magnetization appears. The symmetry that is broken at the Curie point is 
rotation symmetry. Above the Curie point, the paramagnetic system is 
rotationally invariant, while below it the spontaneous magnetization selects a 
preferred direction in space. The order parameter for this continuous phase 
transition is the magnetization, M. The magnetization is a vector and changes 
sign under time reversal. The free energy is a scalar and does not change sign 
under time reversal. If a magnetic field, H, is applied to the system, the 
Ginzburg-Landau free energy can be written in the form 


<£(7\H) = cf)o(T ) - M H + a 2 M • M + q 4 (M • M) 2 + • • • , (3.78) 

where the coefficients 012 and a 4 have the same properties as described above. 
For the case when the applied field H equals 0, the magnetization M equals 0 
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Fig. 3.23. The specific heat of nickel in the neighborhood of the Curie point. The 
dashed line gives the Curie point [ 34 ]. (Based on Ref. 34 .) 


above the Curie temperature and M = ±\J(ao/2QL 4 )(f c — T))M below the 
Curie temperature. M is a unit vector which gives the direction of the 
magnetization vector. The actual direction of the magnetization vector, if 
H = 0, will be determined by random fluctuations or outside influences. The 
heat capacity at the Curie point exhibits the characteristic A-shaped peak. As an 
example, the A-point in nickel is shown in Fig. 3.23. 

3.G.2. First-Order Transitions 

If the order parameter is a scalar, then there is no reason to exclude a third-order 
term in the molar free energy, (f). Also, if the order parameter is a second-order 
tensor, there are ways to contract a product of three such tensors to yield a 
scalar and again the free energy can have a third-order term. Such systems 
cannot exhibit a continuous transition. To see why, let us write the free energy 
in the form 


cf)(T, Y , r;) = <f) 0 (T , Y ) + a 2 T] 2 + a 3 r] 3 + a 4 rf H , (3.79) 

where a 2 = a 2 (T , Y),ot 3 = a 3 (T, Y), and a 4 = a 4 (T, Y). 

The extrema of the free energy are given by the equation ( d(f>/drj) T Y = 
which has solutions 77 = 0 and 77 = ((— 3a 3 ± y/9a \ — 32a 2 a 4 )/&a 4 ). As long 
as 9^3 - 32a 2 a 4 < 0, the only minimum of the free energy occurs at 77 = 0 
because other values of 77 will be complex and therefore unphysical. When 
9a3 — 320:20:4 > 0, two minima and one maximum can exist. A plot of the free 
energy for o 2 > 0, 04 > 0, and a range of values of 03 is given in Fig. 3.24. For 
curves A, B, and C the state of minimum free energy (the equilibrium state) 
occurs for 77 = 0. Curve D shows the behavior of the free energy at the critical 
temperature for this system. At this point the equilibrium state shifts from one 
with order parameter 77 = 0 to one with order parameter 77 = rj D . Therefore, this 
is a discontinuous transition and is of the type one expects for a first-order 
transition. The transition point for the first-order transition (curve D in Fig- 
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Fig. 3.24. The behavior of the free energy, (f> = airj 1 + a^rf + a 4 rf, for ot'i = 2.0, 
a 4 = 4.0, and (A) a 3 = -4.5, (B) a 3 = -5.333, (C) a 3 = -5.5, (D) a 3 = -5.6568, 
and (E) a 3 = -5.85. In the figure, t]e = 0.7738. 


3.24) is easily found. It must satisfy the conditions (f) — (j ) o = 0 and 
(d(f)/dr]) TY = 0. These two conditions give rj = —Cf$/2a 4 and a .2 = a\/Aa 4 . 
Therefore’ the first-order transition occurs when a .2 > 0, and therefore it occurs 
before any continuous transition as the temperature is lowered. If a 3 < 0, then 
it occurs for a positive value of the order parameter. If 0:3 > 0, then it occurs for 
a negative value of the order parameter. 


3.H. CRITICAL EXPONENTS [27-29] 

The critical point is the point at which the order parameter of a new phase 
begins to grow continuously from zero. We have seen many examples of critical 
points. In the liquid-vapor transition, a critical point terminated the liquid- 
vapor coexistence curve and was the one point for which the free energy slope 
changed continuously. For the binary mixture, the critical point marked the 
temperature at which phase separation could first take place as we lowered the 
temperature. In the spin system the critical point for transition from a 
paramagnetic state to ferromagnetic state was the Curie point. In the 
superconductor, the critical point was the point at which the condensate first 
appeared as we lowered the temperature in the absence of an external field. In 
liquid He 4 , the condensed and normal phases were separated by a line of critical 
Points in the (P, T) plane. 

Systems exhibit dramatically new behavior below the critical point. As we 
approach the critical point from above (higher temperature), the system 
anticipates its new behavior by making “adjustments” on a microscopic scale. 
These “adjustments” appear in the form of fluctuations in density, magnetiza- 
tion, and so on, which become very large as the critical point is approached. Just 
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below the critical point, the order parameter of the new phase first becomes 
nonzero. 


3.H.I. Definition of Critical Exponents 

A subject of great interest is the way in which various systems approach the 
critical point. For this purpose it is useful to introduce the idea of critical 
exponents. As one approaches the critical point, various thermodynamic 
functions may diverge or go to zero or even remain finite. It is therefore 
convenient to introduce an expansion parameter 


T-T c 



(3.80) 


where T c is the critical temperature, which is a measure of the distance from the 
critical point in terms of reduced variables. 

Near the critical point all thermodynamic functions can be written in the 
form 


/(e) = Ae A (l +*£>' + •••) (3.81) 


where y > 0. The critical exponent for the function /(e) is defined 


A = lim 

c — *0 


In /(g) 

lne 


(3.82) 


If A is negative, /(e) diverges at the critical point. If A is positive, /(e) goes 
to zero at the critical point. In Fig. 3.25 we have plotted /(e) = e A 
for A == — 1 , — 3/2, 1/2, and 1/4. The case where A = 0 may correspond to 
several different possibilities; for example, it may correspond to a logarithmic 
divergence /(e) = A|lne| +B or to a dependence on e of the form 
/(e) = A + Z?e 1 / 2 . For such cases a modified exponent is introduced. If j is 
the smallest integer, such that d j f(e) jde 7 = /^ (e) diverges, then 


A' = j + lim 

e — >0 


In|/%)l 

lne 


(3.83) 


In Fig. 3.26 we have plotted /(e) as a function of e for some cases when A = 0. 
Although we have chosen to write e in terms of temperature, it is also of interest 
to introduce exponents for the approach to the critical point involving quantities 
other than temperature, such as pressure, density, magnetic field, and so on. 
Thus, there are a number of different critical exponents that can be defined for a 
system depending on how the critical point is approached. We shall give some 
examples in the next section. 
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0 0.05 0.10 0.15 0 0.05 0.10 0.15 

Fig. 3.25. Plots of /(e) = e A for cases when A f 0 (a) Plots for A = -1 and A = — 
(b) Plots for A = | and A = \ . 


/(e) = HOI /(e) = I'**' 



Fig. 3.26. Plots of /(e) = e A for cases when A = 0, (a) Plot of /(e) = | In (e) | . (b) Plot 
of /(e) = 1 - e x ' . 


3.H.2. The Critical Exponents for Pure PVT Systems 

There are four critical exponents which are commonly used to describe the bulk 
thermodynamic properties of PVT systems. We define them and give their 
experimental values below. 

(a) Degree of the Critical Isotherm. The deviation of the pressure ( P — P c ) 
from its critical value varies at least as the fourth power of (V — V c ) as the 
critical point is approached along the critical isotherm. It is convenient to 
express this fact by introducing a critical exponent, 6, such that 

^r^ A6 f-pr \ sign(p_pc) ’ t=t ‘’ 


(3.84) 
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where P c is the critical pressure, p c is the critical density, At is a constant, and 
P c ° is the pressure of an ideal gas at the critical density and temperature. 
Experimentally it is found that 6 > <5 > 4. The exponent 6 is called the degree 
of the critical isotherm. 

(b) Degree of the Coexistence Curve. Guggenheim [ 6 ] has shown that the 
deviation (T — T c ) varies approximately as the third power of (V — V c ) as the 
critical point is approached along the coexistence curve from either direction. 
One expresses this fact by introducing a critical exponent (3 , such that 

^l = A 0 (-e) li , (3.85) 

Pc 

where pi is the density of liquid at temperature T < T c , p g is the density of gas 
at temperature T <T C , each evaluated on the coexistence curve, and Ap is a 
constant. The quantity pi — p g is the order parameter of system. It is zero above 
the critical point and nonzero below it. The exponent /3 is called the degree of 
the coexistence curve and is found from experiment to have values / 3 « 0.34. 


(c) Heat Capacity. The heat capacity at constant volume appears to have a 
logarithmic divergence for T -+ T c along the critical isochore (V = Vc). The 
critical exponent for heat capacity is denoted a and is defined as follows: 


(A' a (-e)~_ a \ T<T C) P = p c , 
\ A a (+e) Q , T > T c , p = pc, 


(3.86) 


where A' a and A a are constants. The exponents a and a' are found experi- 
mentally to have values a ~ 0.1 and a' ~ 0 . 1 . 


(d) Isothermal Compressibility. The isothermal compressibility diverges 
approximately as a simple pole: 


T < T c , p = Pl (T) or p g (T) 
\A 7 (e) 7 , T>T C , p = p c , 


(3.87) 


where A 1 and A 7 are constants. For T <T C one approaches the critical point 
along the coexistence curve; for T > T c one approaches it along the critical 
isochore. Typical experimental values of f and 7 are 7 ' ~ 1.2 and 7 ~ 1.3. 


(e) Exponent Inequalities. It is possible to obtain inequalities between the 
critical exponents using thermodynamic arguments. We shall give an example 
here. Equation (3.40) can be rewritten in terms of the mass density as 


x g T ( dp g \ 2 

Cv = x g c Vg + xic Vl + J 


xiT ( dp t \ 2 
+ p ]k 1 t \dT ) 


coex 


coex 


(3.88) 
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where c v , c Vg and c Vl are now specific heats (heat capacity per kilogram), and kj 
is the isothermal compressibility. All terms on the right-hand side of Eq. (3.88) 
are positive. Thus, we can write 


. > *gT (dp s \ 2 

v " P y T 


(3.89) 


As the critical point is approached for fixed volume, x g — > (1/2), p g — > p c (p c 
is the critical density), kj diverges as (T c - T)~^ [cf. Eq. (3.87)], and 
(< dp g /dT) coex diverges as (T c - T) p ~ l if we assume that [(l/2)(p z + p g ) - p c \ 
goes to zero more slowly than {pi — p g ) [cf. Eqs. (3.26) and (3.27)]. Thus, 


1 T c B{T C - T) y+2p ~ 2 

Cv ^ zz -j 


where B is a constant, and 

In c v ^ (2 - 7 ' - 2/3)| ln(-c)|. 


(3.90) 


(3.91) 


If we next divide by | ln(— £)| and take the limit T — ► T c , we find 

a' + 2/3 + j'>2. (3.92) 


The inequality in Eq. (3.92) is roughly satisfied by real fluids. If we choose 
a' = 0.1,P = (1/3), and 7 ' = 1.3, then a' + 2(5 + 7 ' « 2. Equation (3.92) is 
called the Rushbrook inequality. 

The critical exponents can be computed fairly easily starting from mean field 
theories such as the van der Waals equation (cf. Exercise (3.4)) or Ginzburg- 
Landau theory. All mean field theories give similar results. The common feature 
of these theories is that they can be derived assuming that the particles move in 
a mean field due to all other particles. The mean field theories do not properly 
take into account the effects of short-ranged correlations at the critical point and 
do not give the correct results for the critical exponents. We shall return to this 
point when we discuss Wilson renormalization theory of critical points. In 
Section S3.C, we define the critical exponents for the Curie point in a magnetic 
system. 


I EXERCISE 3.4. Compute the critical exponents, a , 6, and 7 for a gas 

whose equation of state is given by the van der Waals equation. 

Answer: The van der Waals equation in terms of reduced variables is 
(R + (3/v 2 ))(3v— 1) = 87\ In order to examine the neighborhood of the 
critical point, we introduce expansion parameters e = ( T/T c ) — 1, 
I ^ = (v/v c ) — 1 , and 7 r = (P/P c ) — 1 - In terms of these parameters, the 
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van der Waals equation can be written 


(1 + 7r) + 


3 

(i+w ) 2 


[3(cj + 1) — 1] — 8(1 + s). 


If we solve for n, we find 


8 £ + 16 £CJ + 8 £LJ 2 — 3 LJ 3 
2 + Icj + 8w 2 + 3 cj 3 


(1) 

(2) 


(a) The degree of the critical isotherm , 6. Let e = 0 in Eq. (2) and expand 
7 r in powers of u. This gives 

7T = -^ 3 + ---. (3) 

Thus, the degree of the critical isotherm is 6 = 3. 

(b) The isothermal compressibility exponent, 7 . Let us compute (dn /duJ) £ 
and then set cj = 0. We obtain 



( 4 ) 


for cj = 0. Thus, the critical exponent, 7 , is 7 = 1 . 

(c) The degree of the coexistence curve, (3. In the neighborhood of the 
critical point, the van der Waals equation can be written 

3 

7r = 4e — 6ecj + 9 eu 2 — -cj 3 -j . (5) 


The values of u on either side of the coexistence curve can be found 
from the conditions that along the isotherm, 


vdP = 0 and E(v^) = P(vi), ( 6 ) 

where P = P/P c is the reduced pressure and v g = v g /v c and 
v/ = vijv c are the reduced molar volumes of the gas and liquid, 
respectively, on the coexistence curve. The condition that the pressure 
and temperature of the liquid and gas in the coexistence region be 
equal yields 

-\cj] - 6ecJi + 9 euj = -§u ; 3 - 6 eu g + 9 ecj 2 g . (7) 

If we note that ui is negative and write cji = —ui and cj 8 = +u g , then 
Eq. (7) gives 

4e(o >/ + uj g ) + 6e(u)f - cjg) -h cj 3 + cj 3 = 0. 


(8) 
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The fact that the molar Gibbs free energy of the liquid and gas phases 
are equal means that J y v ' vdP = 0 or that 


•W/ 

dui^ 1 u) 

Jlj 0 


—6e + 18 uje — -ur + 


-6 e(ui - u g ) + 6e(uj - uj 2 g ) - - [u] - u 3 g ) = 0. 


4e(d)/ + Ug ) + 4e(u^ — u + u 3 + u 3 — 0. (10) 

In order for Eqs. (8) and (10) to be consistent, we must have u g = <1 )/. 
If we plug this into Eq. (8) or (10), we get ui = u g = u. This gives 
u 2 « — 4e. Note that e is negative. 

Thus, 

u g « ui = 2\e\ 1 ^ (11) 

and the degree of the coexistence curve is (3 = 1/2. 

(d) The heat capacity exponent, a. The jump in the heat capacity can be 
obtained from Eq. (3.40). Let us approach the critical point along the 
critical isochore, v = v c . Then as T — ► T c ,xi « x g — ► 1/2 and 
c v — ► c Vi . Thus, the jump in the heat capacity is given by 


c Vc ( T c ) - c v e ( T c) = Jim < - x -zr h»F 


Along the coexistence curve ( dv/dT) coex = ±\e\~ x ^ 2 , where the 
minus sign applies to the liquid and the plus sign applies to the gas. 
From Eqs. (5) and (11) we find 


@) 7 = ©)r' 6N “^ 2 “' M£| + '" (13) 

= -12|e| ±0(|e| 3/2 ). 

If we note that ( P c v c /RT c ) = (3/8), we find 

cv ,( 27 ) - c ^i T t) =\r + o(t). (14) 

Thus, the van der Waals equation predicts a finite jump in the heat 
capacity at the critical point and therefore it predicts a' = a = 0. 
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^ SPECIAL TOPICS 
^ S3.A. Surface Tension 

In Section 2.H.1, we showed that two parts of a thermodynamic system, at 
equilibrium, must have the same pressure if the interface between them is free 
to move and transmit mechanical energy. In deriving the condition for 
mechanical equilibrium however, we neglected any contributions due to surface 
tension. In this section we shall show that, if the surface moves in such a way 
that its area changes, surface tension can affect the mechanical equilibrium 
condition. 

For simplicity, let us consider a monomolecular thermodynamic system 
consisting of two phases, gas (vapor) and liquid in contact and in thermo- 
dynamic equilibrium. A typical example would be water droplets in contact 
with water vapor. As we discussed in Section 2.C.6, very strong unbalanced 
molecular forces at the liquid surface act to contract the surface. If the liquid 
droplet grows and the surface area increases, then work must be done against 
these surface forces. This provides an additional contribution to the mechanical 
energy of the system. 

In order to obtain corrections to the condition for mechanical equilibrium 
due to surface effects, let us consider a single liquid droplet of radius, R, 
floating in gas (we neglect gravity). We assume that the entire system is held at 
temperature T and is contained in a fixed total volume, Vtot = V g + Vi, where 
V[ = (4/3)7 tR 3 is the volume of the liquid droplet and V g = Vtot — V/ is the 
volume of the gas phase (cf. Fig. 3.27). The surface area of the droplet is 
A = AnR 2 and its surface tension is a. The number of molecules in the gas 
phase is N g and in the liquid phase is Ni. The number of molecules comprising 
the surface, N s , generally will be very small compared to those in the liquid or 



Fig. 3.27. A single liquid droplet in equilibrium with its vapor. 
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gas phases, so we take N s « 0. Since the gas and liquid phases are free to 
exchange particles and heat, at equilibrium the chemical potentials jl g and jli, of 
the gas and liquid phases respectively, will be equal and the temperature will be 
uniform throughout. However, the pressures of the gas and liquid phases need 
not be equal if surface effects are included. Since we are dealing with a 
monomolecular substances, the chemical potential will be a function only of 
pressure and temperature. Therefore, at thermodynamic equilibrium we have 

m(P h T) = fi g (P g ,T). (3.93) 

Because we are dealing with a system at fixed temperature, total volume, and 
chemical potential, it is convenient to work with the grand potential (cf. Section 
2.F.5). The grand potential for the entire system can be written 

ft = -P g V g - PiVi + oA = -(V tot - f 7 tR 3 )P 8 - f 7i tR 3 Pi + a4nR 2 . (3.94) 

For a system at equilibrium with fixed temperature, total volume, and chemical 
potential, the grand potential must be a minimum. Therefore, we obtain the 
condition for thermodynamic equilibrium: 

(llf) = 4jr/f 2 (P* - Pi) + 8*rJ?<7 = 0 (3.95) 

\dRJ T Vtot J1 


or 


P,-P g = ^- (3.96) 

If the interface between the two parts of a thermodynamic system has a surface 
tension and if the surface has curvature, then the condition for mechanical 
equilibrium must be corrected to include the effects of surface tension. The 
pressures in the two parts of the system need no longer be equal. The surface 


Table 3.1. The Surface Tension for Various 
Liquids in Contact with Air 


Substance 

<7 (dynes/cm) 

T(°C) 

Water 

72.0 

25 

Water 

67.9 


Alcohol 

21.8 

25 

Alcohol 

19.8 


Soap solution 

25.0 

25 

Mercury 

487 

15 
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tension varies with temperature and generally decreases as the temperature is 
raised. The addition of impurities to the liquid will reduce the surface tension 
because it disrupts the tight network of “unbalanced” intermolecular forces at 
the interface. Some values of the surface tension of several substances are given 
in Table 3.1. Equation (3.96) was obtained by thermodynamic arguments. In 
Exercise 3.5, we show that it can also be obtained by purely mechanical 
arguments. 


■ EXERCISE 3.5. Use purely mechanical arguments to derive Eq. (3.96) 
for a spherical liquid droplet, with radius R, floating in a gas. Assume that 
the liquid has pressure Pi, the gas has pressure P g , and interface has surface 
tension a. 

Answer: Consider the forces acting on an area element, dA — 
r R 2 sin (0)d0d(j), located in the interval 6 —> 0 + dO and 0 — *• 0 4- d(f) on 
the surface of the droplet. There will be a force outward (inward) along the 
direction, r, due to the pressure of the liquid (gas). The net force on the area 
element due to the pressure difference is 

d F P = (Pi — P g )rR 2 sin OdOdcf), ( 1 ) 

where r = sin(0) cos(0)x + sin(0) sin(0)y 4- cos(0)z. The surface tension 
exerts force on the edges of the area element which are tangent to the surface 
and perpendicular outward at each edge, and it tends to stretch the area 
element in all four directions (note that the units of surface tension are 
(work/area)=(force/length)). The side, (#—►# + dO, 0), of the area element, 
dA, experiences a force 

di\ = —yoRdO, (2) 

where RdO is the length of the side and the unit vector, <p, can be written 
cp = — sin(0)x + cos(0)y. The side, (0 — > 0 + dO, 0 + dcj)), of the area 
element, dA, experiences a force 

df 2 = (0 + d<f>)aRdO, (3) 

where RdO is the length of the side and the unit vector, 0 + dcj), can be 
written cp + dtp = — sin(0 + dcf>)x + cos (0 + d(f>) y. The side, ( 0 , 0 — >• 0+ 
d<f>), of the area element, dA, experiences a force 

dfi = — 0crtfsin(0)d0, (4) 

where Rsin(0)d(j) is the length of the side and the unit vector, 0, can be 
written 0 = cos(0) cos (0)x + cos(0) sin(0)y + sin(0)z. The side, (0 + dO , 
0 _ > 0 -f d(j)), of the area element, dA, experiences a force 

d f4 = (0 + dQ)crR sin(0 + d0)d<p , 


( 5 ) 
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where R sin(0 + dO)d(f) is the length of the side and the unit vector, 0 + d0, 
can be written 0 + ^0 = cos(0 + dd) cos (0)x + cos(0 + dO) sin(0)y+ 
sin(0 -I- dd) z. If the droplet is to be in equilibrium, these forces must add 
to zero. 

| The condition for mechanical equilibrium takes its simplest form if we 
integrate these forces over the top hemisphere of the droplet — that is, if we 
{ integrate the forces over the interval 0 ^ 0 ^ 7t/2 and 0 ^ 0 ^ 2n. [Before we 
| integrate, we must expand Eqs. (3) and (5) and only keep terms of order 
I dOd<f>\. The integration over (f> causes all contributions in the x and y 
I directions to drop out and we get 

| 1 [d F P + d f i + d f 2 + d f 3 + d f 4 ] = [(/»/ - P g )nR 2 - 2ttc7/?]z. (6) 

' J hemisphere 

j If the droplet is to be in equilibrium, these forces must add to zero. 
Therefore, 

: (P l -P g )R-2a = 0, (7) 

f and we obtain the same condition for mechanical equilibrium as was 
obtained from thermodynamic arguments. 


It is interesting to obtain the equilibrium vapor pressure of the gas as a 
function of the radius of the droplet. Let us assume that a reversible change in 
the pressure of the liquid and gas occurs in such a way as to keep the system in 
equilibrium and keep the temperature fixed. Since we are dealing with a 
monomolecular substance, we know that reversible changes in the chemical 
potential of the liquid and vapor can be written dfx' = vdP — sdT, where 
v = V/N is the volume per particle and s = S/N is the entropy per particle. If 
the radius of the droplet changes, keeping the droplet in equilibrium with the 
vapor and keeping the temperature fixed ( dT = 0), then d^ = drf g and 

vidPi = v g dP g , (3.97) 

where v/ = VjNi and v g = V g /N g . However, from Eqs. (3.96) and (3.97), we 
can write dPi — dP g = d(2a/R) and 



(3.98) 


Let us now restrict ourselves to the case when the volume per particle of the 
liquid is much smaller than that of the gas (far from the critical point) so that 
v i <^v g . Then we can approximate the volume per particle of the gas by the 
ideal gas value, v g = {kpT /P g ), and Eq. (3.98) takes the form 


k B T dP g __ d 


v/ Pg 


2a 

~R 


(3.99) 
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Fig. 3.28. A plot of the equilibrium vapor pressure of the droplet as a function of the 
radius of the droplet. 

From Eq. (3.99), we can now find how the equilibrium vapor pressure of the 
gas varies as a function of the radius of the droplet. We will neglect changes in 
v/ as the pressure changes because liquids are very incompressible. Let us 
integrate the right-hand side of Eq. (3.99) from R = oo to R and the left-hand 
side from P g { oo) to P g {R). Then we find 

PgW » p g(°°) ex p(j^)- ( 3 - 100 ) 

The quantity P g ( oo) is the vapor pressure when the interface has no curvature 
(i.e., is flat). For a curved interface, the vapor pressure must be greater to 
maintain the droplet in equilibrium with the vapor then. Equation (3.100) tells 
us that only droplets of a given size can exist in equilibrium with a vapor at 
fixed P and T. Drops of different size will be unstable. 

A plot of the equilibrium vapor pressure as a function of the radius of the 
droplet is given in Fig. 3.28. For a given radius, R — Ro, the equilibrium vapor 
pressure is P g (Ro). If there happens to be a droplet with radius, R > Ro, then the 
pressure, P g (Ro), is too high and the droplet will absorb fluid in an effort to 
reduce the pressure of the gas and eventually will condense out. If R < Ro, the 
pressure of the gas is too low and the droplet will give up fluid in an effort to 
increase the pressure and will disappear. As a result, the droplets tend to be a 
uniform size. 


► S3.B. Thermomechanical Effect [30] 

The behavior of liquid He 4 below 2.19 K can be described in terms of a model 
which assumes that liquid He 4 is composed of two interpenetrating fluids. One 
fluid (superfluid) can flow through cracks too small for He 4 gas to leak through, 
and it appears to carry no entropy. The other fluid (normal fluid) behaves 
normally. The fact that the superfluid carries no entropy leads to some very 
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Fig. 3.29. Two vessels containing liquid He 4 below 2.19 K and connected by a very fine 
capillary. Only superfluid can pass between the two vessels. 

interesting behavior, some of which can be described with classical ther- 
modynamics. Let us consider two vessels, A and B, filled with liquid He 4 at a 
temperature below 2.19 K, and let us assume that they are connected by a 
capillary so thin that only the superfluid can flow through it (cf. Fig. 3.29). Let 
us further assume that the vessels are insulated from the outside world. This 
means that the total entropy must remain constant if no irreversible processes 
take place. Let us also assume that the total mass and total volume of the system 
remain constant. Under these conditions, the equilibrium state is a state of 
minimum internal energy. 

We can obtain the condition for equilibrium between the vessels if we 
assume that matter can flow between them but entropy cannot. The total internal 
energy will be denoted U T , and M/ will denote internal energy per kilogram 
(specific internal energy) in vessel /. The total internal energy is then given by 

U t =Y^ (3.101) 

l=A,B 

where M/ is the total mass of liquid He 4 in vessel l. At equilibrium, the total 
internal energy must be a minimum. Thus, 

SU T = 0 = ( Ul6Ml + M i Su i)- (3.102) 

l=A,B 

Let us now assume that the total volume, Vi, and the total entropy, Si, of liquid 
He in vessel l (for l — A,B) are constant (this is possible because only 
superfluid can flow between the vessels and superfluid carries no entropy). The 
entropy of liquid He 4 in vessel l can be written Si = M/5/, where M/ and 5/ are 
the total mass and specific entropy of liquid He 4 in vessel l. Similarly, 
V/ = M/v/, where v/ is the specific volume of liquid He 4 in vessel /. Since 
Si and V/ are constants, we can write 6Si = M/fo/ + stfMi = 0 and 6V[ = 
M/<5v/ + viSMi = 0. Therefore, 
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and 


SM t 

ovi = -V/ — . 

Ml 


(3.104) 


Let us now expand the differential, Sui, in Eq. (3.102) in terms of specific 
entropy and specific volume. Equation (3.102) then takes the form 



(3.105) 


If we note that ( du/ds) v = T and ( du/dv) s = — P [cf. Eqs. (2.68) and (2.69)1 
and make use of Eqs. (3.103) and (3.104), we obtain 

(«i - s,T t + v,P,)SM, = = 0, (3-106) 

l=A,B l=A,B 

where /}/ is the specific chemical potential of liquid He 4 in vessel /. Since the 
total mass is conserved, we can write 6M A = —6Mb and we obtain the 
equilibrium condition 


MTa,Pa) = MT b ,P b ). (3.107) 

Since atoms can be exchanged, the chemical potentials of liquid He 4 in the two 
vessels must be equal. However, since heat cannot be exchanged and volume 
cannot change (no mechanical energy transfer), the pressure and temperature of 
the two vessels need not be the same. 

We can now vary the temperature and pressure in one of the vessels (vessel 
A, for example) in such a way that the two vessels remain in equilibrium. The 
change in chemical potential in vessel A is 


A£l a = -sAT a + vA P A , (3.108) 

where s = —(dfi/dT)p and v = (dp,/dP) T . But to maintain equilibrium, we 
must have A \1a — 0 so the chemical potentials of the two vessels remain equal. 
Therefore, 


AP a =-AT a . (3.109) 

v 

Thus, a change in temperature of vessel A must be accompanied by a change in 
pressure of vessel A. If the temperature increases, the pressure will increase. 
This is called the thermomechanical effect. 

The thermomechanical effect is most dramatically demonstrated in terms of 
the so-called fountain effect. Imagine a small elbow tube filled with very fine 
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Fig. 3.30. The fountain effect. 


powder, with cotton stuffed in each end. Assume that a long, thin capillary tube 
is put in one end and the elbow tube is immersed in liquid He 4 at a temperature 
below 2.19K. If we now irradiate the elbow tube with a burst of light, the 
pressure of helium in the elbow tube will increase and helium will spurt out of 
the capillary tube (cf. Fig. 3.30). This is called the fountain effect and is a 
consequence of Eq. (3.109). When helium in the elbow tube is heated by 
radiation, superfluid will flow into the elbow tube to equalize the chemical 
potentials and increase the pressure in the elbow tube. It is interesting that in the 
fountain effect there is a spontaneous flow of matter from a cooler to hotter 
vessel. However, since the superfluid does not carry entropy, this does not 
violate the second law. 


► S3.C. The Critical Exponents for the Curie Point 

For magnetic systems, exponents a, /?, 7 , and 6 can be defined in analogy with 
pure fluids. The phase diagrams for simple magnetic systems are given in 
Figs. 3.31-3.33. In Fig. 3.31, we sketch the coexistence curve for a fer- 
romagnetic system. Below some critical temperature the spins begin to order 
spontaneously. The coexistence curve separates the two directions of 
magnetization. In Fig. 3.32 we plot some isotherms of the magnetic system, 
and in Fig. 3.33 we plot the magnetization as a function of temperature. It is 
helpful to refer to these curves when defining the various exponents. 

(a) Degree of the Critical Isotherm. The exponent 6 describes the variation of 
magnetization with magnetic field along the critical isotherm 
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Fig. 3.31. Coexistence curve for a typical magnetic system. Below the Curie point the 
magnetization occurs spontaneously. The curve H = 0 separates the two possible 
orientations of the magnetization. 


M 



Fig. 3.32. A sketch of the isotherms for a ferromagnetic system. 


where = k B T/mo, Mo(0) is the magnetization in zero field at zero 
temperature, mo is the magnetic moment per spin, and B$ is a proportionality 
constant. Experimentally, 6 has values 4 < 6 < 6 in agreement with the values of 
6 for pure fluids. 


(b) Magnetization Exponent. In a magnetic system, the exponent (3 describes 
how the magnetization approaches its value at the critical point when no 
external field is present. It is defined as follows: 


M 0 (T) 

M 0 (0) 




where Bp is a constant. For magnetic systems, /? « \ as it is for fluids. 


(3.111) 
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AT 



Fig. 3.33. A sketch of the magnetization 
for a simple ferromagnetic system. 


(c) The Heat Capacity. For magnetic systems, the coefficients a and a' are 
defined as follows: 


C„(H = 0) 


K 

[B a 


(-«)' 


T<T C , 

T>T C , 


(3.112) 


where B„ and B? a are constants. Experimentally, one finds a ~ a' ~ 0. 


(dl The Magnetic Susceptibility. The magnetic susceptibility in the vicinity of 
the critical point can be written 


xi fa ;(-e)' y , r < r c , h = o, 

X°t \B 1 e-', T > T c , H = 0, 


(3.113) 


where and # 7 are constants and Xt * s fhe susceptibility of a noninteracting 
system at the critical temperature. For real systems, 7 has been found to be 
7 ~ 1.3. 

The striking feature about the critical exponents for fluids and for magnetic 
systems is that the values are roughly the same. Indeed, there appears to be a 
great similarity in the way in which many systems approach their critical 
points. 


► S3.D. Tricritical Points 

Ginzberg-Landau theory allows us to describe a tricritical point (a point where 
a line of A points meets the critical point for a line of discontinuous transitions). 
Let us consider a free energy of the form 

<HT, y, V) = MT, Y ) + <*2T?(T, Y) + c M v\T , Y) + a 6 rf(T, T) H , 

(3.114) 
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where a 2 = a 2 (T, y),<24 = <24 (T, Y ), and ae = Oie(T, y).The extrema of the 
free energy are given by 

(^77) — 2c * 2 77 + 4Qf 4 ?7 3 + 6c2 6 r? 5 = 0 . ( 3 . 115 ) 


This has solutions 


77 = 0 and 


2 -C24 ± \ jo & — 3C22C26 

" = 3al 


(3.116) 


Because there is no cubic or fifth-order term in the free energy, we can have a 
regime in which there is a line discontinuous transitions and a regime in which 
there is a line of continuous phase transitions. The conditions that must be 
satisfied for a discontinuous transition to occur are (j) — 0o = 0 and 
(d(j)/drj) TY = 0. These two conditions give 77 2 = (the discontinuous 

transition can only occur for 724 < 0 so rj 1 is non-negative) and 

(3 - n7) 

The line of discontinuous transitions is given by the values of T and Y which 
satisfy Eq. (3.117). The line of continuous phase transition is given by the 
condition 


a 2 (T,Y) = 0. (3.118) 



Fig. 3.34. A plot of the neighborhood of the tricritical point in the ( 7 ; Y) plane. The 
point t connecting the line of continuous phase transitions (dotted line) and the line ol 
first-order phase transitions (solid line) is a tricritical point. It is the meeting point of 
three phases. 
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The lines of discontinuous and continuous phase transitions meet at a point, 
(' T t ,Y t ), determined by the conditions a 2 (T t ,Y t ) = 0 and a 4 (T h Y t ) — 0. A 
schematic picture is given in Fig. 3.34. The point, t, was called a tricritical 
point by Griffiths [25]. He showed that in a suitable space it is the meeting point 
of three lines of critical points (two of them associated with the discontinuous 
transition). 

A tricritical point occurs in He 3 -He 4 mixtures (cf. Fig. 3.19). It is 
simultaneously the critical point for the binary phase transition and the end 
point of a line of A points associated with superfluid phase transition. 


^ S3.E. Binary Mixtures [2, 31-33] 


If we consider a fluid which is a mixture of several different types of interacting 
particles, a phase transition can occur in which there is a physical separation of 
the fluid into regions containing different concentrations of the various types of 
particles. The simplest example of this type of phase transition occurs for binary 
mixtures. 

It is useful first to obtain a number of thermodynamic relations applicable to 
all binary mixtures. The Gibbs free energy for a binary mixture composed of n\ 
moles of type 1 particles and n 2 moles of type 2 particles is 

G = G{T,P,n\,n 2 ) = «iAO +n 2 p 2 , (3.119) 

and differential changes in the Gibbs free energy can be written 

dG = —SdT + VdP + p\dn\ + p 2 dn 2 . (3.120) 

The molar Gibbs free energy, g = G/n(n = n\ + n 2 ), is 

g = X l/il+* 2 /22, (3.121) 

where x\ and x 2 are the mole fractions of particles of type 1 and 2, respectively. 
From Eqs. (3.120) and (3.121), it is easy to show that 


dg — —sdT -j- vdP + (pi — p 2 )dx\, 

SO that g = g(T, P,x\). 

The chemical potential of type 1 particles is 

' ,,= (S) w , = * +(i - je,) (S ), i1 

and the chemical potential of type 2 particles is 


P2 


dG\ 

dn 2 ) 


P,T,n\ 


( dg\ 

=8 ~ xi Wj P y 


(3.122) 


(3.123) 


(3.124) 



154 


THE THERMODYNAMICS OF PHASE TRANSITIONS 


where we have used the fact that 


= 2 and = -*. 

\drnJn 2 n \dn 2 J nx n 


(3.125) 


From Eqs. (3.122)-(3.124), we see that the chemical potential depends on the 
mole numbers n\ and n 2 , only through its dependence on the mole fraction, x\. 
We can also show that 


and 



= (1 -*0 
P,T 



(3.126) 




(3.127) 


► S3.E.1 . Stability Conditions 

In Section 2.H.2, we found that chemical stability and Maxwell’s relations 
require that the matrix 


/Ml,l Ml.2^ 

\M2,1 M2, 2/ 

must be symmetric positive definite, where //,-j = (dfii/dnj) PT ^ nk ^ nj y This 
requires that fi ifi > 0 (/ = 1,2) and that every principal minor be positive or 
zero. Thus, for a binary mixture we must have 

Mi,i>0, M2,2>0, detr* 1,1 /i1,2 ) > 0, and ^ 1,2 = M2, l- (3.128) 

\M2,1 M2, 2/ 

For processes that occur at constant T and P, we have some additional con- 
ditions. From the Gibbs-Duhem equation (2.62), for processes in which dT = 0 
and dP = 0 we have 

n\[d^\) PT + n 2 [dfi 2 ] PT = 0. (3.129) 

Furthermore, we can write 



dn i + 

P,T,n 2 



dn 2 


[dp>\ ]pj — 


P,T,ni 


(3.130) 
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and 


\dfi2\pj — 



dn 1 -+■ 

P,T,n 2 



dn 2 . 

P,T,n 1 


(3.131) 


If we combine Eqs. (3.129)— (3.131) and use the fact that the differentials dn\ 
and dn 2 are independent, we find that 


n 2 



P,T,n\ 



and 


n\ 



+w 2 

P,T,n 2 



(3.132) 


(3.133) 


Thus, if /ii,! >0, then fi 2 ,i < 0, then for binary systems the conditions for 
chemical stability for a process at constant T and P reduce to 

(P) >0 - fir) >0 - fir) = fir) <°- 

\dn\J PTri2 \dn 2 ) P j,m \^ n ^J pj,n 2 \^ n 2 J pj m 

(3.134) 


From Eq. (3.127), the molar Gibbs free energy must satisfy the condition 




(3.135) 


Thus, for chemical stability, the molar Gibbs free energy must be a convex 
function of the mole fraction. 


^ S3.E.2. Equilibrium Conditions 

In Section 2.H.2, we found that for a binary mixture to be in equilibrium the 
pressure P, the temperature T, and the chemical potential of each type of particle 
must be equal (uniform) throughout the system. The chemical potentials of 
different types of particles need not be equal. 

If a phase separation occurs in a binary mixture, the chemical potential of 
type 1 particles must be equal in the two phases and similarily for the chemical 
Potential of type 2 particles. This equality of chemical potentials between the 
two phases gives us a condition for locating the coexistence curve. For type 1 
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particles we have fi[ = fi 1 * or 




\dxj 


P,T 


\dxj 


pj 


and for type 2 particles we have — fj.% or 


/ //^V n n(dg\ U 

V-x\ — =g"-x \ ( — , 

\OXl J p T \OX 1 ) p T 


(3.136) 


(3.137) 


where I and II denote the two phases. If we combine Eqs. (3.136) and (3.137), 
we can write the conditions for equilibrium in the form 


and 


dgV = ( dg\ U 

dx\ / p j 1 / p T 


(3.138) 



(3.139) 


Equations (3.138) and (3.139) tell us that the equilibrium points in the (g,xi) 
plane have equal slopes, and they have a common tangent. Equations (3.138) 
and (3.139) locate the coexistence curve. 

In Fig. 3.35 we show a sketch of the molar Gibbs free energy which 
illustrates these various properties. It shows a region where two phases can 


9 



Fig. 3.35. A sketch of the molar Gibbs free energy of a phase separated binary mixture. 
The two points with common tangent are equilibrium states. 
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coexist. The points that have a common tangent are the equilibrium points. The 
concave region in the middle is unstable. In this region the two phases, one rich 
type 1 particles and the other rich in type 2 particles, can coexist. In a test tube, 
the heavier phase will sink to the bottom and the lighter one will float to the top. 

As long as (d^/dx \) PT < 0, the binary mixture will be stable and exist in 
one phase. However, if (d\i 2 /dx i) p T > 0 the system has an unstable region and 
phase separation occurs. The critical point for this phase separation is given by 
\diii/dx\) c PT = 0. The critical point is the point where the jci first becomes a 
double-valued function of [i\ or fi 2 as the temperature is changed. That is, two 
different values of jci give the same value of the chemical potential. Thus, in 
analogy to the liquid-vapor critical point (with P — > and v — * jci), the critical 
point is a point of inflection of the curve = fi 2 (T,P,xi) for T and P 
constant. Therefore, we have the additional condition that (d 2 ^ 2 / dx\) c p T = 0 at 
the critical point. 

A sketch of the coexistence curve, and the curve separating stable from 
unstable states is given in Fig. 3.36. The region outside and above the co- 
existence curve corresponds to allowed single-phase equilibrium states. Below 
the coexistence curve is a coexistence region in which two equilibrium states 
with different concentrations of type 1 particles can coexist at the same 
temperature. The shaded region corresponds to metastable states. These are 
single-phase states which are not in thermodynamic equilibrium but are 
chemically stable. All single-phase states below the dashed line are unstable 
and cannot be realized in nature. Let us follow the horizontal line at 
temperature, T' < T c , in Fig. 3.36. At *1 = 0, we have a system consisting only 
of type 2 particles. As we start adding type 1 particles, the concentration of type 
1 particles increases until we reach the coexistence curve at point I. At this 
point, the system separates into two phases, one in which type 1 particles have 
concentration x{ and another in which type 1 particles have concentration x{ 1 . 
As we increase the number of type 1 particles relative to type 2 particles, the 


T 



Fig. 3.36. The phase diagram for a binary mixture. The point C is the critical point. 
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Fig. 3.37. The phase diagram for a mixture 
of rt-hexane and nitrobenzene (C 6 H 5 N0 2 ) 
at atmospheric pressure. The solid line is 
the coexistence curve. (Based on Ref. 2.) 


amount of phase II increases and the amount of phase I decreases until we reach 
the coexistence curve at point II. At point II, phase I has disappeared and we 
again have a single-phase equilibrium state of concentration, x{ ! . 

We see that there many analogies between the liquid-gas transition and the 
separation of a binary mixture into two phases. An example of a system 
exhibiting this type of behavior is a mixture of n-hexane and nitrobenzene at 
atmospheric pressure. The phase diagram for this system is given in Fig. 3.37. 


■ EXERCISE 3.6. Consider a binary mixture of particles of types 1 and 2, 
whose Gibbs free energy is given by 

G = T) + T ) + /?r«iln(xi) + /?7« 2 ln(.x 2 ) + \nx\X 2 , 

where n = n\ + « 2 ,«i and n 2 are the mole numbers, and x\ and X 2 are the 
mole fractions of particles of types 1 and 2, respectively, (a) Plot the molar 
Gibbs free energy g(P, T,x i) versus x\ for fixed P and T (for definiteness 
assume that $ = 1.0, fi 2 = 105 , RT = 0.8, and A = 1). What are x[ and 
.xf ? ( b ) Use conditions {d^i\/dx\) c PT =(cP- / dx\) c p T = 0 to locate the 
critical point, (c) Find the condition for equilibrium between the two 
phases. Show that the conditions = /i{ 7 and fi ! 2 = the condition 

( dg/dxiYpj = (dg/dx \) l PT are equivalent. 

Answer: 

(a) The molar Gibbs free energy is 

g = — = x \ //J + x 2 fi 2 -I- RTxi ln(*i ) + RTx 2 ln(* 2 ) + A*i* 2 , ( 1 ) 

A plot of g versus x\ is given below for = 1.0, fi 2 = 1.05, 
RT = 0.8, and A = 1. 



SPECIAL TOPICS: BINARY MIXTURES 


159 


The equilibrium concentrations are = 0.169 and xf = 0.873. 



(b) The chemical potential, /i 2 , is 

«=(J0 T =$(P,T)+RT ln(l-xi) + A*f. (2) 

Therefore, at the critical point we have {d^ 2 /dx\ ) C PT = 
-RT/( 1 — jci) + 2Ajci = 0 and (d 2 /^ /dx\) c PT = -RT/{\ -*i) 2 + 
2A = 0. If we solve these two equations for T and x \ , we find that 
the critical point is located at = 1/2 and T = A/2 R. 

(c) There are two ways to obtain the condition for equilibrium between 
the two phases, (i) The condition {dg/dx \) P j= {dg/dx i) PT , 
together with Eq. (1), yields the equilibrium condition 

A(1 - 2*') +*rin(j-^ = A(1 —2*'') + iWln(Y^)’ 


(ii) The conditions = /if and fi l 2 = /if yield the equilibrium 
conditions 

RT\n{x[) + \{\ -x[) 2 =RT ln(*f) + A(1 -*f) 2 (4) 


/?7Tn(l — x[) + A(xj) 2 = /?Tln(l — jcf) + A(xf ) 2 . (5) 

If we subtract Eq. (5) from Eq. (4), we recover Eq. (3). Thus, the two 
equilibrium conditions are equivalent. 
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^ S3.E.3. Coexistence Curve 

Before we write an equation for the coexistence curve, it is necessary to 
introduce a new expression for the chemical potential. Let us first remember 
that the Gibbs free energy can be written G = U - ST + PV = n\[i\ + n 2 fi 2 . 
Therefore, the chemical potential for type 1 particles can be written 


Ah 


fdG\ /dU\ T ( dS \ +p(—\ 

\d n \) p t , „ 2 \d n \) PJ,n 2 \d n l) P,T,n 2 \^ n l ) P,T,n 2 


(3.140) 


The quantity u\ = (dU /dn\) PT ni is a partial molar internal energy, s\ = 
{dS/dn\) PTtl2 is a partial molar’ entropy, and vi = (dV /dn\) PTn2 is a partial 
molar volume. Note that the chemical potential is a partial molar Gibbs free 
energy. Similar quantities exist for type 2 particles. In terms of these “partial” 
quantities, the chemical potentials can be written 


Ah = «i — s\T + v\P and = «2 — s 2 T + v 2 P. 


(3.141) 


It is useful to note the relation between the molar volume, v = V/n and the 
partial molar volume, vj = (dV / dn\) pTfl2 . It is easy to show that 


vi 



= v+ (1 -xi) 

P,T,n 2 



(3.142) 


The partial molar enthalpies of the type 1 and type 2 particles are 


hi = 



~ U\+ V\P 

P,T,n 2 


and 


h 2 = 


dH\ 

pn 2 ) PTni 


= u 2 + v 2 P, 


(3.143) 


respectively. 

If we are in a region of coexistence of the two different phases, then the 
temperature, pressure, and chemical potentials of each type of particle must be 
equal. As we move along the coexistence curve, changes in the chemical 
potentials, temperature, and pressure of the two phases must be equal. Let us 
now consider the differential of the quantity, p\/T. It is 



(3.144) 
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However, 


_d_ /ml = _!h_ 

pT v r / j p n . ni 7*’ 


(3.145) 


where we have used the fact that (d/21 /dT) Prii ni = — {dS/dn\) PTn and /21 = 
h\ — s\T. If we make use of the fact that (d/21 /ciP) Tni „ 2 = ( dV/6n\) p Tni = vi 
and (3.145), we can write Eq. (3.144) in the form 


d 



t2 t t 




and we can write 


d 



hi V 2 1 

— ~=rdT + —dP + — 

ji ' j t 



dx 1, 

P,T 


(3.146) 


(3.147) 


where vi and V2 are partial molar volumes. However, along the coexistence 
curve, dlm/T) 1 = d^i/T) 11 = d(m/T), dT l = dT n = dT, and dP 1 = dP 11 = 
dP, but dx\ 7^ dx". Therefore, we can write the following equations for the 
coexistence curves: 


~~dT + AvidP + dx[ ~ = 0 

i yCMj J p T J p t 


(3.148) 


and 


Ah 7 


dT + Av 2 dP + 


dx‘-( 9 -^\" dx'! = 


dxij 


P.T 


dx\) 


(3.149) 


P,T 


where A/i, — h[ — h" and Av, = v\ — vf for (i = 1,2). 

Let us now obtain an equation for the coexistence curve in the {T,x\) plane 
for constant pressure processes, dP = 0. The equations for the coexistence 
curve reduce to 


A/ii /d/iA 7 (dx\\ (djM\ n ( dx y\ _ 
T \dxi) PJ \dT ) p \dxJ PT \dT ) p ~~ 


(3.150) 


and 


A h 2 


+ 


/d/2 2 y (dx[\ (dyiV f dx"\ 
\dxJpp\dTjp \dxJ PJ \dTj 


= 0 . 


T 


p 


(3.151) 
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We can solve Eqs. (3.150) and (3.151) for ( dx[/dT) p and {dx ! ( /dT) p . If we use 
Eqs. (3.126) and (3.127), we obtain 


and 


fdx\\ _ JcfA/q + (1 — *f)A/i 2 
\ dT )p T{x I { — x\){d 2 g/dx\) I pr 


<£tf\ _ jcjA/ii + (1 — *i)A/i2 
dT ) p T(x[ l - x[){d 2 g/dx\) I PT 


(3.153) 


Thus, in general there is a latent heat associated with this phase transition and it 
is a first-order transition. 


► S3.F. The Ginzburg-Landau Theory of 
Superconductors [9, 26, 35] 

One of the most interesting applications of the Ginzburg-Landau theory 
involves a sample of superconducting material which may contain both normal 
and condensed regions or is of finite size. For such systems we must allow the 
order parameter to vary in position. Assume that a constant external magnetic 
field, H, is applied to the sample, but allow for the possibility of a spatially 
varying order parameter, ^(r). If the sample contains both normal and 
condensed regions, then there will also be a local spatially varying induction 
field, B(r). The order parameter is treated in a manner similar to the wave 
function of a quantum particle. Gradients in the order parameter (spatial 
variations) will give contributions to the free energy of a form similar to the 
kinetic energy of a quantum particle. Furthermore, if a local induction field B(r) 
is present, the canonical momentum of the condensed phase will have a 
contribution from the vector potential A(r) associated with B(r). The local 
vector potential A(r) is related to the local induction field, B(r), through the 
relation 


B(r) = V r x A(r). 

The Helmholtz free energy ^(r, B, T) per unit volume is given by 
a,( r, B, T) =a„(T) + a 2 (r)|®(r)| 2 + a 4 (r)|*(r)| 4 


+ 


B 2 


2m 


iW r ^ - + - — , 

2 / 4 ) 


(3.154) 


(3.155) 


where e and m are the charge and mass, respectively, of the electron pairs, and 
% is Planck’s constant. The quantity, (— ifiV r — eA), is the canonical 
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momentum operator associated with the condensate. It has exactly the same 
form as the canonical momentum of a charged quantum particle in a magnetic 
field. The Gibbs free energy per unit volume is given by a Legendre 
transformation, 


g(r, H, T) = fl(r, B, T) — B • H. (3.156) 

The total Gibbs free energy is found by integrating g( r, H, T) over the entire 
volume. Thus, 


* 

r (H,T)= dr 


1 


a n (T) + a 2 (T)\V(r)\ 2 +a 4 (T)\*(r)\ A 


B 2 


2m 2 no 


(3.157) 


If we now extremize G 5 (H, T) with respect to variations in \I>*(r) and assume 
that on the boundary of the sample the normal component of the canonical 
momentum, (ihV r ^ — eAty), is zero, we find 

a 2 V + 2a 4 y\V\ 2 +r!-(-iW r - <?A(r)) 2 ^ = 0. (3.158) 

2m 

To obtain Eq. (3.158) we have had to integrate by parts. If we next extremize 
G S (H, T) with respect to variations in A(r), we obtain 

— V r xB = +zA(^V r ^ - W r r) A|^| 2 . (3.159) 

fiQ 2mi m 

To obtain Eq. (3.159), we have used the vector identities 

| dVC • (V r x A) = <J>dS • (A x C) + j dV A • V r x C (3.160) 
and 

<J>dS • (E x F) = <j)E • (F x dS), (3.161) 

where J dV is an integral over the total volume V of the sample and j> dS is an 
integral over the surface enclosing the volume V. We have assumed that 
(B - /xoH) x n = 0, where n is the unit vector normal to the surface of the 
sample. Thus, right on the surface the tangent component of the induction 
field is the same as that of a normal metal. However, it will go to zero in the 
sample. 
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The local supercurrent in a superconductor is driven by the local magnetic 
induction field and is defined as 

J,(r) = — V r x B(r) = ~ (®*V^ - *V r **) - - A|4>| 2 (3.162) 

fiQ Ami m 

as we would expect. It is a combination of the current induced by H and the 
magnetization current. If we compare Eqs. (3.159) and (3.162) we see that the 
supercurrent J 5 (r) has the same functional form as the probability current of a 
free particle. Inside a sample, where B(r) = 0, there is no supercurrent. The 
supercurrent is confined to the surface of the sample. 

We can use Eqs. (3.158) and (3.159) to determine how the order parameter, 
'f'(r), varies in space at the edge of the sample. Let us assume that A(r) = 0, 
but that can vary in space in the z direction = \£(z). Under these 
conditions, we can assume that \P(z) is a real function, and therefore there is no 
supercurrent, J 5 = 0. Let us next introduce a dimensionless function, 


/(z)= * w v2r (3J63) 

Then Eq. (3.158) takes the form 

-« 2 (r)0-/+/ 3 = °, (3.164) 

where £(T) is called the Ginzburg-Landau coherence length and is defined as 


«T) = 



(3.165) 


Equation (3.164) can be used to find how the order parameter varies in space on 
the boundary of a superconducting sample. 

Let us consider a sample that is infinitely large in the x and y directions but 
extends only from z = 0 to z = oc in the z direction. The region from z — -' 00 
to z — 0 is empty. We will assume that at z = 0 there is no condensate, 
f(z = 0) = 0; but deep in the interior of the sample it takes its maximum value 
f (z = oo ) = 1 [that is, \P(z) = (|q;2|/2q!4 ) 1//2 deep in the interior of the samplej- 
Equation 3.164 is a nonlinear equation for/, but it can be solved. To solve it we 
must find its first integral. Let us multiply Eq. (3.164) by df /dz and rearrange 
and integrate terms. We then find 




=/ 2 - r r + C, 


'df 

dz j 


(3.166) 
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where C is an integration constant. We will choose C so that the boundary 
conditions 


dz Z-+OC 


0 


and 


/—> 
z-> 00 


1 


are satisfied. This gives C = — ( 1/2) and Eq. (3.166) takes the form 


df 


?\f t I =^o-/ 2 ) 2 - 


We can now solve Eq. (3.167) and find 


f{z ) = tanh 




(3.167) 


(3.168) 


[f(z) is plotted in Fig. 3.38]. Most of the variation of the order parameter occurs 
within a distance z = 2£(T) from the boundary. Thus, the order parameter is 
zero on the surface but increases to its maximum size within a distance 2£(7’) of 
the surface. Near a critical point, £(T) will become very large. 

It is also useful to introduce the penetration depth for the magnetic field and 
determine the manner in which the induction field B(r) dies away at the surface 
of the sample. Let us assume that the order parameter is constant throughout the 
superconductor (we neglect variations at the surface, although this is often a bad 
assumption). Let us also assume that the vector potential is pointed in the y 
direction and can vary in the z direction A = A y (z)y. Then Eq. (3.162) reduces 
to 


where 


A y (z) = A 2 


d 2 A y 
dz 2 ’ 


I 2ma4 
\(*2\e 2 iM) 


(3.169) 


(3.170) 


/(*) 


A 

L 



Fig. 3.38. A plot of f(z) = tanh(z/(\/2£)) 
versus z/(V 2£). 
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is the penetration depth. Thus the vector potential drops off exponentially inside 
the superconductor, 

= A 0 e~ z/ \ (3.171) 

and we obtain the result that all supercurrents are confined to within a distance 
A of the surface. Note that the penetration depth also becomes very large near 
the critical point. 

There are many more applications of the Ginzburg-Landau theory of 
superconductors than can be presented here. The interested reader should see 
Ref. 9. 
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PROBLEMS 

Problem 3.1. A condensible vapor has a molar entropy s = s 0 + R ln[C(v - b) 
(u+ (a/v)) 5 / 2 ], where C and so are constants, (a) Compute the equation of state, (b) 
Compute the molar heat capacities, c v and cp. (c) Compute the latent heat between 
liquid and vapor phases at temperature T in terms of the temperature T, the gas constant 
R, and the liquid and gas molar volumes v/ and v g . How can you find explicit values of v/ 
and v g if you need to? 

Problem 3.2. Find the coefficient of thermal expansion, a coex = (1 /v)(dv/dT) coex , for a 
gas maintained in equilibrium with its liquid phase. Find an approximate explicit 
expression for a coex , using the ideal gas equation of state. Discuss its behavior. 

Problem 3.3. Prove that the slope of the sublimation curve of a pure substance at the 
triple point must be greater than that of the vaporization curve at the triple point. 

Problem 3.4. Consider a monatomic fluid along its liquid-gas coexistence curve. 
Compute the rate of change of chemical potential along the coexistence curve, 
(dp/dT) coex , where p is the chemical potential and T is the temperature. Express your 
answer in terms of si , v\ and s g ,v g , which are the molar entropy and molar volume of the 
liquid and gas, respectively. 

Problem 3.5. A system in its solid phase has a Helmholtz free energy per mole, 
a s = B/Tv 3 , and in its liquid phase it has a Helmholtz free energy per mole, a/ = A /Tv 1 , 
where A and B are constants, v is the volume per mole, and T is the temperature, (a) 
Compute the molar Gibbs free energy density of the liquid and solid phases, (b) How are 
the molar volumes, v, of the liquid and solid related at the liquid-solid phase transition? 
(c) What is the slope of the coexistence curve in the P-T plane? 

Problem 3.6. Deduce the Maxwell construction using stability properties of the 
Helmholtz free energy rather than the Gibbs free energy. 

Problem 3.7. For a van der Waals gas, plot the isotherms in the P-V plane (P and V are 
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the reduced pressure and volume) for reduced temperatures T = 0.5, T = 1.0, and 
T — 1.5. For T = 0.5, is P = 0.1 the equilibrium pressure of the liquid-gas coexistence 
region? 

Problem 3.8. Consider a binary mixture composed of two types of particles, A and B. 
For this system the fundamental equation for the Gibbs free energy is G=n A ^ A + 
the combined first and second laws are dG = -SdT-\- V dP + ^ A dn A + figdriB (S is the 
total entropy and V is the total volume of the system), and the chemical potentials jj A 
and fin are intensive so that (i A = / jl a {P , T,x A ) and (jl b = Pb(P> T,x a ), where x A is the 
mole fraction of A. Use these facts to derive the relations 

sdT — vdP + ^ x a dfx a = 0 (a) 

a=A,B 


and 


Y. x a (dn a + s a dT - v a dP) = 0, (b) 

a=A,B 

where 5 = S/n, v = V/n,n = n A + n B ,s a = {dS/dn a ) PTn ^ a , andv a = (dV/ dn a ) PTnp/ 
with a = A,B and /3 — A,B. 

Problem 3.9. Consider liquid mixture (/ ) of particles A and B coexisting in equilibrium 
with vapor mixture (g) of particles A and B. Show that the generalization of the 
Clausius-Clapeyron equation for the coexistence curve between the liquid and vapor 
phases when the mole fraction of A in the liquid phase is held fixed is given by 

(dP\ = x 8 a (s 8 a -4) +x g B (s g B -4) 

*5M-vi)+*s (vS-vi)’ 

where s a = (dS/dn a ) PT and v a = (dV/dn a ) PTn ^ a with a = A,B and (3 — A,B . 
[Hint: Equation (b) of Problem (3.8) is useful.] 

Problem 3.10. A PVT system has a line of continuous phase transitions (a lambda line) 
separating two phases, I and II, of the system. The molar heat capacity cp and the 
thermal expansivity ap are different in the two phases. Compute the slope (dP/dT) coe x of 
the A line in terms of the temperature T, the molar volume v, A cp = Cp — c p , and 
A ap = otp — oil. 

Problem 3.11. Water has a latent heat of vaporization, Ah = 540cal/gr. One mole of 
steam is kept at its condensation point under pressure at T = 373 K. The temperature is 
then lowered to T = 336 K, keeping the volume fixed. What fraction of the steam 
condenses into water? (Treat the steam as an ideal gas and neglect the volume of the 
water.) 

Problem 3.12. A liquid crystal is composed of molecules which are elongated (and 
often have flat segments). It behaves like a liquid because the locations of the center of 
mass of the molecules have no long-range order. It behaves like a crystal because the 
orientation of the molecules does have long range order. The order parameter for a 
liquid crystal is given by the dyatic S = 77(nn - (1/3)1), where n is a unit vector (called 
the director) which gives the average direction of alignment of the molecules. The free 
energy of the liquid crystal can be written 

(f> = 4>o + \ ASjjSij — -jBSijSjkSki + \ C SijSijSktSki 
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where A = Aq(T — V), Ao,B and C are constants, I is the unit tensor so 
Xi • I • xj = <5y, Sij = x t • S • xj, and the summation is over repeated indices. The 
quantities X| are the unit vectors Xi = x, x 2 = y, and X 3 = z. (a) Perform the summations 
in the expression for <f> and write <f> in terms of 77 , A,£,C. (b) Compute the critical 
temperature T c at which the transition from isotropic liquid to liquid crystal takes place, 
and compute the magnitude of the order parameter 77 at the critical temperature, (c) 
Compute the difference in entropy between the isotropic liquid (77 = 0) and the liquid 
crystal at the critical temperature. 

Problem 3.13. The equation of state of a gas is given by the Berthelot equation 
(P + a/Tv 2 )(v - b) = RT. (a) Find values of the critical temperature T c , the critical 
molar volume v c , and the critical pressure P c , in terms of a, b, and R. (b) Does the 
Berthelot equation satisfy the law of corresponding states? (c) Find the critical 
exponents /?, 6, and 7 from the Berthelot equation. 

Problem S3.1. A boy blows a soap bubble of radius R which floats in the air a few 
moments before breaking. What is the difference in pressure between the air inside the 
bubble and the air outside the bubble when (a) R = 1 cm and (b) R = 1 mm? The surface 
tension of the soap solution is a — 25 dynes/cm. (Note that soap bubbles have two 
surfaces. 

Problem S3.2. Imagine a droplet of water, in equilibrium with its vapor, placed on a 
wire frame that can stretch the surface area of the droplet, keeping the temperature fixed. 
Assume that the whole system, droplet, frame, and vapor is contained in a fixed total 
volume V and kept at temperature T. The grand potential of the system can be written 
ft — fl 0 + fl s , where f l s = a A is the surface grand potential, a is the surface tension, A 
is the surface area of the liquid, and is the grand potential of the remainder of the 
system. Neglect contributions from the frame and changes in the volume of the water 
droplet as it is stretched. The surface tension of water can be written 



where op = 75.5 dynes/cm is the surface tension at temperature, t = 0 °C, n = 1.2, and 
t' = 368 °C. (a) Compute the internal energy per unit area of the surface assuming that 
the number of surface atoms, N s = 0. (b) Plot the surface area and the surface internal 
energy per unit area for the temperature interval t = 0 °C to t = t 1 . 

Problem S3.3. Assume that two vessels of liquid He 4 , connected by a very narrow 
capillary, are maintained at constant temperature; that is, vessel A is held at temperature 
Ta, and vessel B is held at temperature Tg. If an amount of mass, AM, is transferred 
reversibly from vessel A to vessel B, how much heat must flow out of (into) each vessel? 
Assume that T A > T B . 

Problem S3.4. Prove that at the tricritical point, the slope of the line of first-order phase 
transitions is equal to that of the line of continuous phase transitions. 

Problem S3.5. For a binary mixture of particles of type 1 and 2 , the Gibbs free energy is 
= «iA*i + n 2 p 2 and differential changes in the Gibbs free energy are dG = -SdT+ 
V dP + jj, x dn\ + fi 2 dn 2 . The Gibbs free energy of the mixture is assumed to be 

G = ni^(P, T) + n 2 /j, 2 (P, T) + RTn\ In (*i) + RTn 2 In (x 2 ) + \nx\x 2 , 
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where /i° = /x° are the chemical potentials of the pure substances. In the region in which 
the binary mixture separates into two phases, / and II with concentrations x[ and x‘(, 
find the equation, (d x[/dT) p for the coexistence curve. Write your answer in terms of 
and r = T/T c where T c = A/2 R. 

Problem S3.6. Consider a mixture of molecules of type A and B to which a small 
amount of type C molecules is added. Assume that the Gibbs free energy of the resulting 
tertiary system is given by 

G(P, T, nA,ns,ric) =n A /j, A + + ncn° c + RTn A In (x A ) + RTns In (xb) 

+ RTn c In (x c ) + A n A n B /n + \ x n A n c /n + Ai n B n c /n, 

where n = n A + n B + nc,n c < n A , and n c < n B . The quantities &=&{P,T), 
H° B = h° b (P,T), and fx° c = /jP c (P,T) are the chemical potentials of pure A, B, and C, 
respectively, at pressure P and temperature T. For simplicity, assume that 
^ 0 =^ 0 = To lowest order in the mole fraction xc, compute the shift in the 
critical temperature and critical mole fraction of A due to the presence of C. 
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ELEMENTARY PROBABILITY 
THEORY AND LIMIT THEOREMS 


4.A. INTRODUCTION 

Thermodynamics is a theory which relates the average values of physical 
quantities, such as energy, magnetization, particle number, and so on, to one 
another. However, it tells us nothing about processes occurring at the 
microscopic level or how details of microscopic behavior affect thermodynamic 
behavior. The microscopic theory of systems with a large number of degrees of 
freedom is based on probabilistic concepts. In this chapter we will review some 
of the elementary concepts from probability theory that will occur again and 
again throughout the remainder of the book. 

We will begin in a rather simple way by introducing ways of counting and 
classifying large numbers of objects, and we will give an intuitive definition of 
probability which will later be justified by the law of large numbers. 

Stochastic variables are, by definition, variables whose values are deter- 
mined by the outcome of experiments. The most we can know about a 
stochastic variable is the probability that a particular value of it will be 
realized in an experiment. We shall introduce the concept of probability 
distributions for both discrete and continuous stochastic variables and 
show how to obtain the moments of a stochastic variable by using chara- 
cteristic functions. As examples we shall discuss in some detail the bino- 
mial distribution and its two limiting cases, the Gaussian and Poisson 
distributions. 

The simple concepts of this chapter can be carried a long way. In the section 
on special topics we will use them to study random walks on simple lattices in 
one, two, and three spatial dimensions. With these simple examples we can 
explore the effects of spatial dimension on a stochastic process. 

We have seen in Chapter 3 that there is a high degree of universality in the 
thermodynamic behaviour of systems with many degrees of freedom. In 
particular, we found that many systems have the same critical exponents, 
regardless of the microscopic structure of the system. This is an example of a 
Limit Theorem [1] involving highly correlated stochastic quantities. In this 
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chapter we will introduce Limit Theorems for uncorrelated stochastic 
quantities. We first introduce the concept of an infinitely divisible stochastic 
variable and then derive the Central Limit Theorem , which describes the 
limiting behavior of independent stochastic variables with finite variance. 
Finally, we study the limiting behavior of independent stochastic variables with 
infinite variance which are governed by Levy distributions. We use the 
Weierstrass random walk to show that such systems can develop fractal-like 
clustering in space. 


4.B. PERMUTATIONS AND COMBINATIONS [2, 3] 

When applying probability theory to real situations, we are often faced with 
counting problems which are complex. On such occasions it is useful to keep in 
mind two very important principles: 

(a) Addition principle : If two operations are mutually exclusive and the 
first can be done in m ways while the second can be done in n ways, 
then one or the other can be done in m + n ways. 

(b) Multiplication principle : If an operation can be performed in n ways, 
and after it is performed in any one of these ways a second operation is 
performed which can be performed in any one of m ways, then the two 
operations can be performed in nxm ways. 

These two principles will underlie much of our discussion in the remainder of 
this chapter. 

When dealing with large numbers of objects it is often necessary to find the 
number of permutations and/or combinations of the objects. A permutation is 
any arrangement of a set of N distinct objects in a definite order. The number of 
different permutations of N distinct objects is N\ The number of different 
permutations of N objects taken R at a time is (N\/(N — /?)!). 

Proof : Let us assume that we have N ordered spaces and N distinct objects with which 
to fill them. Then the first space can be filled N ways, and after it is filled, the second can 
be filled in (N — 1 ) ways, etc. Thus the N spaces can be filled in 


N(N- \)(N - 2) x ••• x 1 = N\ 


ways. To find the number of permutations, P%, of N distinct objects taken R at a time, let 
us assume we have R ordered spaces to fill. Then the first can be filled in N ways, the 
second in (N — 1) ways,. . ., and the /?th in (N - R + 1) ways. The total number of ways 
that R ordered spaces can be filled using N distinct objects is 

AH 

P» = N(N- 1) x • • • x (N - R + 1) = - 
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A combination is a selection of N distinct objects without regard to order. The 
number of different combinations of N objects taken R at a time is 
(N\/(N-R)\R\). 

Proof: R distinct objects have R\ permutations. If we let C R denote the number of 
combinations of N distinct objects taken R at a time, then R\C r = P R and 
C n r =N\/(N -R)!R! 

The number of permutations of a set of N objects which contains n\ identical 
elements of one kind, «2 identical elements of another kind,. . ., and n* identical 
elements of a kth kind is N\/n\ !« 2 ! • • • «*!, where «i + «2 H 1- w* = N. 


I ■ EXERCISE 4.1. Find the number of permutations of the letters in the 
i word, ENGINEERING. In how many ways are three E’s together? In how 
many ways are (only) two E’s together. 

i 

I Answer: The number of permutations is (11 !/3!3!2!2!) = 277,200, since 
there are 1 1 letters but two identical pairs (I and G) and two identical triplets 
| (E and N). The number of permutations with three E’s together = the 
number of permutations of ENGINRING = (9!/3!2!2!) = 15, 120. The 
j number of ways that only two E’s are together = 8 x (15, 120) = 120,960, 

! since there are eight ways to insert EE into ENGINRING and its 
' permutations. 


4.C. DEFINITION OF PROBABILITY [2-4] 

Probability is a quantization of our expectation of the outcome of an event or 
experiment. Suppose that one possible outcome of an experiment is A. Then, the 
probability of A occurring is P(A) if, out of N identical experiments, we expect 
that NP(A ) will result in the outcome A. As N becomes very large ( N — > oo) we 
expect that the fraction of experiments which result in A will approach P(A). An 
important special case is one in which an experiment can result in any of n 
different equally likely outcomes. If exactly m of these outcomes corresponds to 
event A, then jP(A) = m/n. 

The concept of a sample space is often useful for obtaining relations between 
probabilities and for analyzing experiments. A sample space of an experiment 
is a set, S, of elements such that any outcome of the experiment corresponds to 
one or more elements of the set. An event is a subset of a sample space S of an 
experiment. The probability of an event A can be found by using the following 
procedure: 

(a) Set up a sample space S of all possible outcomes. 

(b) Assign probabilities to the elements of the sample space (the sample 
points). For the special case of a sample space of N equally likely 
outcomes, assign a probability l/N to each point. 
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Fig. 4.1. (a) The shaded area is the union of A and B,AUB; (b) the shaded area is the 
intersection of A and B, A D B\ (c) when A and B are mutually exclusive there is no 
overlap. 


(c) To obtain the probability of an event A, add the probabilities assigned to 
elements of the subset of S that corresponds to A. 

In working with probabilities, some ideas from set theory are useful. The 
union of two events A and B is denoted A U B. A U B is the set of all points 
belonging to A or B or both (cf. Fig. 4.1a). The intersection of two events is 
denoted A D B. A D B is the set of all points belonging to both A and B (cf. 
Fig. 4.1b). If the events A and B are mutually exclusive, then AC\B = 0 where 
0 is the empty set (Afl5= contains no points) (cf. Fig. 4.1c). 

We can obtain some useful relations between the probabilities of different 
events. We shall let P{A) denote the probability that event A is the outcome of 
an experiment (P(0) = 0,P(S) = 1); we shall let P(Ar\B) denote the 
probability that both events A and B occur as the result of an experiment; 
and finally we shall let P(A U B) denote the probability that event A or event B 
or both occur as the outcome of an experiment. Then the probability P(A U B ) 
may be written 

P(A U B) = P(A) + P(B) - P{A (IB). (4.1) 

In writing P(A) + P(B), we take the region A fl B into account twice. 
Therefore, we have to subtract a factor P(A fT B). 

If the two events A and B are mutually exclusive, then they have no points in 
common and 

P(AUB)=F(A)+P(B). (4.2) 

If events Aj, A 2 , . . . , A m are mutually exclusive and exhaustive, then 
A\ U A 2 U • • • U A m = S and the m events form a partition of the sample space 
S into m subsets. If Ai,A 2 , . . . ,A m form a partition, then 

P(Ai) + P(Aj) + •••-(- P(A m ) = 1. 

We shall see Eq. (4.3) often in this book. 


(4.3) 
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The events A and B are independent if and only if 

P{Af\B)=P{A)P{B). (4.4) 


Note that since P{A fl B) ^ 0, A and B have some points in common. Therefore, 
independent events are not mutually exclusive events. They are completely 
different concepts. For mutually exclusive events, P(A D B) = 0. 

The conditional probability P{B\A) gives us the probability that event A 
occurs as the result of an experiment if B also occurs. P(B\A) is defined by the 
equation 


P(B\A) = 


P(AHB) 

m 


(4.5) 


Since P(A n B) = P(B D A), we find also that 

P(A)P(A|B) = P(B)P(B\A). (4.6) 


From Eq. (4.4) we see that if A and B are independent, then 

P(B\A) = P(A). (4.7) 

The conditional probability P(B|A) is essentially the probability of event A if 
we use the set B as the sample space rather than S. 


| ■ EXERCISE 4.2. Consider a sample space consisting of events A 
j and B such that P(A) = | , P{B) = |, and P(A\JB) = \. Compute 
j P(A fl J5),P(J5|A), and P(A|P). Are A and B independent? 

I 

| Answer: From Eq. (4.1), P{A fl B) = P(A) + P(B) — P(A U B) = ^. But 
P(A fl5)^ P(A)P(B) so A and B are not independent. The conditional 
| probabilities are P(A\B) = P{A HB)/P(A) = | and P(B\A) = P{AC\B)/ 
P{B) = |. Thus, | of the points in A also belong to B and | of the points in B 
I also belong to A. 


4.D. STOCHASTIC VARIABLES AND PROBABILITY 

In order to apply probability theory to the real world, we must introduce the 
concept of a stochastic, or random, variable (the two words are interchangeable, 
but we shall refer to them as stochastic variables). A quantity whose value is a 
number determined by the outcome of an experiment is called a stochastic 
variable. A stochastic variable, X, on a sample space, S, is a function which 
maps elements of S into the set of real numbers, {R}, in such a way that the 
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inverse mapping of every interval in {R} corresponds to an event in S (in other 
words, a stochastic variable is a function which assigns a real number to each 
sample point). It is useful to note that the statement select at random means that 
all selections are equally probable. 

In a given experiment, a stochastic variable may have any one of a number of 
values. Therefore, one must be careful to distinguish a stochastic variable 
(usually denoted by a capital letter X) from its possible realizations, {*,}. Some 
examples of stochastic variables are (i) the number of heads which appear 
each time three coins are tossed; (ii) the velocity of each molecule leaving a 
furnace. 

In this section we introduce the concept of a probability distribution 
function , a probability density function, and a characteristic function which can 
be associated with a stochastic variable and which contain all possible 
information about the stochastic variable. We also generalize these quantities to 
the case when several stochastic variables are necessary to fully describe a 
system. 

4.D.I. Distribution Functions 

4. D. 1.1a. Discrete Stochastic Variables 

Let X be a stochastic variable on the sample space, S, and assume that X has a 
countable set of realizations, *,•, where i = 1 , 2 , ...,«(« is either a finite integer 
or n = oo). One can make S a probability space by assigning a probability, p t , to 
each realization, The set of values, {/?,}, is the probability distribution on S 
and must satisfy the conditions and Y^iPi = 1* 

We can introduce a probability density function, P*(x), defined as 

P *( X ) = ^2Pi S ( x ~ *«')> ( 4 - 8 ) 

i=l 

and a distribution function, Fx{x), defined as 

F x (x) = f dyP x {y) = ^p;0(x-*,). (4-9) 

J-oo ,=i 

where ©(* — jc,-) is a Heaviside function and has values ©(*) = 0 for x < 0 
and ©(*) = 1 for x > 0. Note that the probability density function is just 
the derivative of the distribution function, Px{x ) = dFx{x)/dx. The distri- 
bution function, F x (x), is the probability that the stochastic variable, X, has 
a realization in the interval (-oo,*). In order that the probability density, 
P x (x), always be positive, the distribution function, F x {x), must be 
monotonically increasing function of x. It has limiting values Fx(-oc) = 0 
and Fx(+oo) = 1. 
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■ EXERCISE 4.3. Consider a weighted six-sided die. Let x, = i 
(i = 1,2, ... ,6) denote the realization that the i th side faces up when the 
die is thrown. Assume that p { = ± ,p 2 = \ ,p 3 = g ,P 4 = g ,P 5 = ^ ,P6 = n » 
in Eqs. (4.8) and (4.9). Plot Px(x) and F x (x) for this system. 


Answer: 



height equal to />,. 


4.D.l.lb. Continuous Stochastic Variables 

Let X be a stochastic variable which can take on a continuous set of values, such 
as an interval on the real axis. From the definition of a stochastic variable, we 
know that an interval {a^x^b}, corresponds to an event. Let us assume that 
there exists a piecewise continuous function, PxC*)> such that the probability 
that X has a value in the interval {a^x^b} is given by the area under the 
curve, P x (x) versus x, between x = a and x = b. 


Prob (a^x^b) 


% b 

dxP x (x). 


Ja 


(4.10) 


Then X is a continuous stochastic variable, P x {x) is the probability density for 
the stochastic variable, X, and P x (x)dx is the probability to find the stochastic 
variable, X, in the interval x — ► x + dx. The probability density must satisfy the 
conditions P x (x)^0 and = 1 (we have assumed that the range of 

X is — oo^jc^oo). 

We can also define the distribution function, F x (x), for the continuous 
stochastic variable, X. As before, it is given by 


Fx(x) = 


dyPx(y ), 

J —00 


(4.11) 


a nd it is the probability to find the stochastic variable, X, in the interval 
{—oo^x}. The distribution function, Fm(x), must be a monotonically 
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increasing function of x, and it has limiting values F x (— oo) = 0 and 
F x {- l-oo) = 1. 


■ EXERCISE 4.4. Plot the probability density, Px(x), and the distri- 
bution function, F x (x), for the Gaussian probability density, Px(x) ~ 
(l/2v^)^ /8 . 


Answer: 



Often we wish to find the probability density, not for the stochastic variable, 
X , but for some new stochastic variable, Y = H(X), where H(X) is a known 
function of X. The probability density, Py(y), for the stochastic variable, Y, is 
defined as 

( OO 

dx6(y-H(x))P x (x), (4.12) 

— OO 

where S(y — H(x)) is the Dirac delta function. 


4.D.2. Moments 

If we can determine the probability density, P x {x), for the stochastic variable, 
X, then we have obtained all possible information about it. In practice, this 
usually is difficult. However, if we cannot determine P x (x), we can often obtain 
information about the moments of X. The nth moment of X is defined 



dxx n Px(x). 


(4.13) 


Some of the moments have special names. The moment, (x), is called the mean 
value of X. The combination, (x 2 ) — (x) 2 , is called the variance of X, and the 
standard derivation of X, a x , is defined as 


a x = <J(F) - <*> 2 


(4.14) 
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The moments give us information about the spread and shape of the probability 
density, Px(x). The most important moments are the lower-order ones since 
they contain information about the overall behavior of the probability density. 
We give some examples below. 

4.D.2.1a. The First Moment 

The first moment, (x), gives the position of the “center of mass” of the 
probability density, Px(x). It is sometimes confused with two other quantities, 
the most probable value, x p , and the median, x m . The most probable value, x p , 
locates the point of largest probability in Px{x). The median, x m , is the value of 
x which divides the area under the curve, Px(*) versus x, into equal parts. In 
other words, F x (x m ) = 

In Exercise (4.4), because of the symmetric shape of the Gaussian 
distributions shown there, the mean, (x), the most probable value, x p , and the 
median, x m , all occur at the same point, x = 0. In Exercise 4.5, we give an 
example in which they all occur at different points. 


■ EXERCISE 4.5. Locate (x), x p , and x m for the probability density, 


P x (x), and the probability distribution, F x (x), shown below. 

1.0 

i p x(x) 

r 

1.0 

Fx{x) 

/ \ 0.6 

pw] 

0.6 


°.2 

— / i - ':-t j • A 

j 

/ 0.2 

' 

- 1.5 - 1.0 - 0.5 C 

0.5 1.0 X 

- 1.5 - 1.0 - 0.5 C 

0.5 1.0 1.5 X 


Answer: The mean is (x) = —0.5625. The most probable value is 
I- x p = —1.0. The median is x m = -0.862. 


4.D.2.1b. The Second Moment 

The second moment, (x 2 ), gives the “moment of inertia” of the probability 
about the origin. The standard deviation, cr x [cf. Eq. 4.14] gives a measure of 
how far the probability spreads away from the mean, (x). It is interesting to 
consider some examples. In Exercise (4.4), (x 2 ) = 4, (x) = 0, and a x = 2. In 
Exercise 4.5, (x 2 ) = 0.966, (x) = -0.5625, and cr* = 0.806. 

4, D. 2.1c. The Third Moment 

The third moment, (x 3 ), picks up any skewness in the distribution of probability 
about x = 0. This can be seen in Exercise 4.4 and 4.5. For Exercise (4.4), 
(x 3 ) = 0 because the probability is symmetrically distributed about x = 0. 
However, for the probability distribution in Exercise 4.5, (x 3 ) = —0.844, 
indicating that most of the probability lies to the left of x = 0. 
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4.D.3. Characteristic Functions 

The characteristic function, f x (k), corresponding to the stochastic variable, X, is 
defined as 

fx(k) = (e**) = f” dxe ik *P x (x) = jrMM. (4.15) 

J-oo „=o n. 

The series expansion in Eq. (4.15) is meaningful only if the higher moments, 
(*”), are small so that the series converges. From the series expansion in Eq. 
(4.15) we see that it requires all the moments to completely determine the 
probability density, P x (x). Characteristic functions are continuous functions of 
k and have the property that f x (0) = 1 , | f x {k)\ ^ 1 , and fx(-k) =f£{k ) (* 
denotes complex conjugation). The product of two characteristic functions is 
always a characteristic function. 

If the characteristic function is known, then the probability density, P x (x), is 
given by the inverse transform 

1 f 00 

Px{x)=— dke~ lhc f x (k). (4.16) 

^ J— 00 

Furthermore, if we know the characteristic function we can obtain moments by 
differentiating: 

(4.17) 

Equation (4.17) provides a simple way to obtain moments if we know /*(&). 

It is often useful to write the characteristic function, f x {k), in terms of 
cumulants, C n (X), rather than expand it directly in terms of moments. The 
cumulant expansion is defined 

(4.18) 

where C„(X) is the « th -order cumulant. If we expand Eqs. (4.15) and (4.18) in 
powers of k and equate terms of the same order in k, we find the following 
expressions for the first four cumulants: 


C,(X) = (x), 

(4.19) 

C 2 (X) = ( x 2 ) - (x) 2 , 

(4.20) 

C 3 (X) = (x 3 ) - 1{x){J) + 2(x) 3 , 

(4.21) 



and 


C 4 (X) = {. x 4 ) - 3(^> 2 - 4(x){x*) + 12<*> V) - 6(x)\ 


(4.22) 
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If higher-order cumulants rapidly go to zero, we can often obtain a good 
approximation to fx(k) by retaining only the first few cumulants in Eq. (4.18). 
We see that C\{X) is just the mean value of X and C 2 (X) is the variance. 


■ EXERCISE 4.6. Consider a system with stochastic variable, X , which 
has proba bility density, Px(x), given by the circular distribution; Px(x) = 
— x 2 for |;t| <1, and Px{x) = 0 for \x\ > 1. Find the characteristic 
function and use it to find the first four moments and the first four cumulants. 

Answer: The characteristic function is 


fx(k)=- dxe lKx y/\ - x 2 = yJi(k), 

7T t _i K 

(cf. Gradshteyn and Ryzhik [8]) where Ji(k) is a Bessel function. Now 
expand fx(k) in powers of k 


^ /fN 2 \k k 3 k 5 

^^"1 2 16 + 384 + 


_ 1 \_k^_ 

42! + 84!~ 


From Eq. (4.17), the moments are (x) = (x 3 ) — 0, (x 2 ) = J, and (x 4 ) = ±. 
The cumulents are Cj = C 3 = 0, C 2 = J, and C 4 = 


4.D.4. Jointly Distributed Stochastic Variables 

The stochastic variables, X\,X 2 ,-..,X n , are jointly distributed if they are 
defined on the same sample space, S. The joint distribution function for the 
stochastic variables, X\,X 2 , . . . ,X n , can be written 

Fx u ...ji n {x\,. ..,x n ) = Prob{Xi < x u . . . ,X n < *„}, (4.23) 

where {Xi < * 1 , . . . ,X„ < *„} = {Xi < * 1 } fl • • • fl {X n < x n }. In other words, 
(* 1 , • • • ,*n) is the probability that simultaneously the stochastic 
variables, X it have values in the intervals {-00 < X t < *,} for i = 1, . . . ,n. 
The joint probability density, ...,*„) is defined as 


(* 1 1 • • • ? x n ) — 


d n Fx i„..^ n (*i, • ••,*») 

dx\ • • • dx n 


(4.24) 


so that 


Fx\ t ...jc n {x\ 1 • • • ?*n) f ■■■[ dx\ ' ’ ‘dXnPx\, ...^(Xl, • • • yX n ). (4.25) 
J —OC j — OO 

For simplicity, let us now consider the joint distribution function, F X y(x,y), 
and joint probability density, Pxjix^y), for two stochastic variables, X and Y. 
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The distribution function satisfies the conditions Fx,y{-oc,y) = 
Fx,y(x, -oo) = F x ,y{- oo, -oo) = 0 and F x ,y{ oo,oo) = 1. Furthermore, 

FxA x ^y) - F x,y{ x ^y) =Pj‘ob{xi < for x 2 >x\. 

(4.26) 

The probability density, Px,y{x,y), satisfies the condition 0 ^Pxy(x, y) ^ 1, and 
is normalized to one: 


dx 


dyP x ,y(x,y) = 1. 


(4.27) 


If we want the reduced distribution function, F x (x), for the stochastic variable, 
X, it is defined as 


F x (x) = F x ,y(x, oo) = 



*0O 

dyP X y{x\y). 

— OO 


(4.28) 


Similarly, the reduced probability density, Px{x), for the stochastic variable, X , 
is defined as 


Px(x) = 


•OO 

dyP x ,y(x,y). 

— OO 


(4.29) 


We can obtain the reduced distribution function, F Y (y), and the reduced 
probability density, P Y (y), in a similar manner. 

The nth moment of the stochastic variable, X, is defined as 


<y> 


f OO 



dyx n P x ,y(x,y). 


J — OO 


J — OO 


(4.30) 


Joint moments of the stochastic variables, X and Y, are defined as 




•oo roo 

dx dyx m y n P x ,y(x,y). 

— oo -oo 


(4.31) 


There are two related joint moments that are commonly used in the physics 
literature. They are the covariance, 

Cov (. X , Y) = ((x - (x))(y - (y))) = (xy) - (x)(y), (4.32) 


and even more commonly the correlation function 


Cor (. X , Y) 


((x- (x))(y~ (y))) 


OxOy 


(4.33) 
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where ax and ay are the standard deviations of the stochastic variables X and Y, 
respectively. The correlation function, Cor(X, y), is dimensionless and is a 
measure of the degree of dependence of the stochastic variables X and Y on one 
another (cf. Exercise 4.7). The correlation function has the following properties: 

(i) Cor(X,y) =Cor(T,X). 

(ii) -l^Cor(X,y)^l. 

(iii) Cor (X, X) = l,Cor(X,-X) = -1. 

(iv) Cor ( aX + b,cY + d) = Cor (X, y) if a,c ^ 0. 

The notion of joint correlation function can be extended to any number of 
stochastic variables. 

For two stochastic variables, X and Y, which are independent , the following 
properties hold: 

(i') Px,r( x >y ) = PxU)Pr(y)- 
(ii') (XY) = (X)(Y). 

(iii') <(X + Y) 2 ) - (( X + Y)) 2 = (X 1 ) - (X) 1 + (Y 2 ) - (Y) 2 . 

(iv') Cor(X,y) = 0. 


Note that the converse of (iv') does not necessarily hold. If Cor (X, Y) = 0, it 
does not always means that X and Y are independent. 

■ EXERCISE 4.7. Show that the correlation function, Cor(X, Y), is a 
measure of the degree to which X depends on Y. 

Answer: Let X = aY + b and choose a and b to minimize the mean square 
( error, e = ((jc — ay — b ) 2 ). Set be = (de/da)6a + ( be/bb)bb = 0, and set 
! the coefficients of 6a and 6b separately to zero. This gives two equations, 
-2 (xy) + 2 a(y 2 ) + 2b (y) = 0 and -2(x) + 2 a(y) + 2b = 0. Eliminate b and 
solve for a to find a = —Cor (X, Y)ax/ay. Thus when Cor (X, Y) = 0, a = 0 
and the random variables, X and Y, appear to be independent, at least when 
| vie wed by this statistical measure. 

When we deal with several stochastic variables, we often wish to find the 
probability density for a new stochastic variable which is a function of the old 
stochastic variables. For example, if we know the joint probability density, 
P xj(x,y), we may wish to find the probability density for a variable 
Z = G(X, y), where G(X, y) is a known function of X and Y. The probability 
density, Pz(z), for the stochastic variable, Z, is defined as 



dy6(z-G(x,y))P x , Y (x,y). 


(4.34) 
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From Eq. (4.34), the characteristic function for the stochastic variable, Z, is 
easily found to be 


fz(k) = 


dx 


dye ,kG ^P x , Y (x,y). 


(4.35) 




■ EXERCISE 4.8. The stochastic variables, X and Y, are independent and 
are Gaussian distributed with first moments, (x) = (y) = 0, and standard 
deviations, o x = <*y — 1- Find the joint distribution function for the 
stochastic variables, V = X + Y and W = X - Y. Are Vand W independent? 


Answer: The probability densities for X and Tare P x (x ) = (l/v^)^ 1 / 2 )* 2 
and Py(y) = (1 / y/2n)e~^^ 2 ' )y2 . Since X and Y are independent, Eq. (4.34) 
gives 


Pv,w(v,w) = 


2i r 


dx 


dy6(v — v'(x,y))<5(w — w'(x,y))e 


0) 


where v'(*,y) = x-\-y and w'(x,y) = x — y. Now note that 

6(v - v'(x,y))6(w - w'^y)) =j( X y W - *'(v, w))6(y - y'(v, w)), 

\w v/ 

( 2 ) 

where ^(v, w) = \ (v + w), /(v, w) = \ (v — w), and j( X ^ = \ is the 
Jacobian of the coordinate transformation. Thus, w v 


P v ,wiy i w) 


_L e -( 1 / 4 )(v 2 +‘ | ' 2 ) 

4tt 


V and W are independent since Py,w(v, w) is factorizable. 


( 3 ) 


We can also introduce characteristic functions for jointly distributed 
stochastic variables. They are often useful for obtaining moments. We define 


dx^e 


i{k\ x \+-+k N x N ) 


poo 

fx u ...,x N (k\, . . . ,k N ) = \ dx 

J—OO 

x Px u ...,X N (x 1 , . • ■ ,Xn). 

The joint moment, (x f - • • • x n ) (n^N), is then given by 

d d 

<*' ■■ -x n ) = Km • ■ • Um (-/)”— x • - x —f x , tv). 


(4.36) 


(4.37) 
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One of the most important multivariant distributions, and one which we will use 
repeatedly in this book, is the multivariant Gaussian distribution with zero 
mean. Some of its properties are discussed in Exercise (4.9). 


■ EXERCISE 4.9. The multivariant Gaussian distribution with zero mean 
can be written 


Px u ...J{ N (xi, • • ■ ,x N ) — 


/det(g) _ ( 1 / 2 )x r.g. x 


where g is a symmetric N x N positive definite matrix, x is a column vector, 
and the transpose of x, \ T = (xi, . . . ,xn), is a row vector. Thus, 
x' f • g • x = 1 E/ii SijXiXj. (a) Show that Px u ...* N (x \, . • • ,x N ) is normal- 
ized to one. (b) Compute the characteristic function, /*, • ■ • , &w). (c) 

Show that the moments, (x,) = 0 and that all higher moments can be 
expressed in terms of products of the moments, (xf) and (*,*,). This is the 
simplest form of Wick’s theorem used in field theory. 


Answer: 


(a) Since g is a symmetric matrix, it can be diagonalized using an 
orthogonal matrix, O, where O * = O 7 and O 1 • O = I (I is the unit 
matrix). Note that the determinant of O is one, det(O) = 1. We can 
write O • g • O 7 = f , where f is diagonal (T,, = 7 , and Ty = 0 for 
i ^ j). The orthogonal matrix, O, transforms the stochastic variables. 
We will let a = O • x = (ai, ... ,a N ). Since det (O) = 1, the 
Jacobian of the transformation is one. Thus, dx\-- dx n = 
da\ • • • da N . Furthermore, x 7 • g • x = <x T • f • • • a. Thus, 



since det(g) = det(r) = 71 • • ■ 7„. 
(b) The characteristic function is 
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form, as we did in part (a), it is straightforward to show that 

(c) The first moment is (*,■) = \im ki ^ 0 (-i)(d/dki)fx u ...,x N (*i, . . . = 

0, since the derivative brings down a factor of kj. In fact, all odd 
moments will be zero for the same reason. The second moment is 
easily seen to be (xfij) = (g _1 )i>- Inspection shows that all higher 
moments depend only on sums of products of factors of the form 
(g and therefore only on sums of products of second moments. 
More generally, the average of a product of2n stochastic variables is 
equal to the sum of all possible combinations of different pairwise 
averages of the stochastic variables. For example, 

<*1* 2 *3* 4 ) = (*1*2) (X3X4) + (*1*3) (*2*4) + (*1*4) (*2*3) • 

(Note that (xfij) = (xp c,).) More generally, if we have In stochastic 
variables, the number of terms in the expression for the 2nth moment, 

(* 1*2 • • ‘ x 2n) = (X\X 2 )(x 3 X 4 ) X • • • X {x 2n -\X 2 n) + ■ * ■ , 

is (2n)!/n!2 n . We determine this as follows. There are 
((2n)(2n — l)/2) ways to form the first pair. After that there are 
((2 n - 2) (2 n — 3)/2) ways to form the second pair, and so on. After 
all the pairs are formed in this way, by the multiplication rule, there 
will be YYi=o ((2n — 2i)(2n — (2 i + 1 ))/2) = ((2n)!/2 n ) different 
combinations of pairs. Of these, n\ will be identical because they 
are different permutations of the same pairs. Thus, the total number of 
different terms in the expression for (jcijc 2 • • -x 2n ) is (2n)!/«!2". 


4.E. BINOMIAL DISTRIBUTIONS 

A commonly found application of probability theory is for the case of a large 
number, N, of independent experiments, each having two possible outcomes. 
The probability distribution for one of the outcomes is called the binomial 
distribution. In the limit of large N, the binomial distribution can be 
approximated by either the Gaussian or the Poisson distribution, depending 
on the size of the probability of a given outcome during a single experiment. We 
shall consider all three distributions in this section. We shall also use the 
binomial distribution to find the probability density for a random walk in one 
dimension. 


4.E.I. The Binomial Distribution [2-5] 

Let us carry out a sequence of N statistically independent trials and assume that 
each trial can have only one of two outcomes, 0 or +1. Let us denote the 
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probability of outcome, 0, by q and the probability of outcome, +1, by p so that 
p + q — 1 . In a given sequence of N trials, the outcome, 0, can occur no times 
and the outcome, +1 times, where N = n 0 + n\. The probability for a given 
permutation of n 0 outcomes, 0, and n\ outcomes, +1, is q m p n 1 since the N trials 
are statistically independent. The probability for any combination of n 0 
outcomes, 0, and n\ outcomes, +1, is 


Ps(nx) = (4.38) 

nQ\n\\ 

since a combination of n2 outcomes, 0, and n\ outcomes, +1, contains 
(iV!/n 0 !ni!) permutations. Equation (4.38) is often called the binomial 
distribution even though it is not a distribution function in the sence of 
Section 4.D. From the binomial theorem, we have the normalization 
condition 

N N v , 

5' w "' , -5 |4J ” 

We can view the sequence of trials in another way. Let X t be the stochastic 
variable describing the outcome of the ith trial and let X, have two realizations: 
x = 0 with probability q and x = +1 with probability p. The probability density 
for the ith trial is P Xl (x) = qb(x) + pS(x — 1), and the characteristic function for 
the ith trial is f Xi (k) = q+ pe ik . 


■ EXERCISE 4.10. The probability of an archer hitting his target is If he 
j shoots five times, what is the probability of hitting the target at least three 
times? 

Answer: Let n\ be the number of hits. Then Af = n 0 + «i =5 ,p = 5, and 
[ <7 = 3- The probability of having n\ hits in N = 5 trials is 
! ^5(«i) = (5!/ ! (5 — ni)!)0”‘ Q) 5 The probability of at least three hits 
|_=^(3)+ P S (A) + P S (5) = w 0.21. 


We will now let Y N = X\ +Xi~\ + X^ be the stochastic variable which 

describes the additive outcome of N trials. Since the trials are independent, the 
probability density for the stochastic variable, Y N , is just 


Py N (y) = 


dx 1 • • • dx N S(y -xi x N )P Xi (xi) x ■ ■ ■ x P Xn (x n ). 


(4.40) 
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From Eq. (4.35), the characteristic function for the random variable, Y N , is 


fy„(k) = [&,■••[ dx N e ik ^ + - +X ^p x , (x,) X ■ ■ ■ x P x „(x N ) 
= A,W X ••• xfxjk) = (g+pe' t ) N . 


(4.41) 


We can expand the characteristic function using the binomial theorem. We 
obtain 


fr N (k) - 


N\ 


^ 0 ni\(N - ni)V 


i qN~~n\ e in\k 


(4.42) 


The probability density, Py N ( x), for the stochastic variable, Y N , is 


PrM= 2i 


dke ,ky fr„{k) = ^ 


N\ 


Z^ ni !(N - m)! 


p nx cf*~ ni 8(y — ni) 


(4.43) 


= y]^v(ni)%-«i), 

«i=0 


where P N (n i) is the probability that the outcome, +1, will occur n\ times in N 
trials. 

The moments, ( y ) = (ni), are easily found by direct summation or by 
differentiating the characteristic function. For example, 

("0 = - E n , !( ”'! ! „ |)! P"'g A '- - p£(p + *)" =pN ■ 

( 4 . 44 ) 

Also, it is easy to see that 

(y) = Ijm(-i) =pN= <«i). ( 4 - 45 ) 

In a similar manner, we obtain for the second moment 

(/> = <«?) = J2 «ifti(»i) = (A'P) 2 + N P1- ( 446) 

«i=0 


The variance is given by (n\) - («i) 2 = N pq, and the standard deviation is 

G N = y/N pq. ( 4 . 47 ) 
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Aofai) 



Fig. 4.2. The binomial distribution for p — \ and N — 10; for this case, {n\) = y . 


The fractional deviation is 


on _ fq 1 

(«i) \pVn' 


(4.48) 


The fractional deviation is a measure of the deviation of the fraction, n\ /N, of 
trials with outcome, +1, from its expected value, p, in any single sequence of N 
trials. A small value of a N /{n\) means that n\/N will likely be close to p. For 
N — > oo, o N /{n\) — * 0 so that n\/N — ► p. The binomial distribution, for the 
case N = 10 and p = \ is plotted in Fig. 4.2. 


4.E.2. The Gaussian (or Normal) Distribution 

Let us make a change of stochastic variables, Zn = (y# — (y )) / oy = 
((*v -p N )/y/pqN). Z N is a stochastic variable whose average value, (z), 
equals 0 when (y) = pN. The probability density for the stochastic variable, Zn, 
is 


Pz N {z) = 



(y-pN) \ 
y/pqN ) 


p y N {y)- 


The characteristic function is 


fz N {k) = 


'OO 

—oo 


dze ikz P Zn (z) 



= (q e~ ik ^ p ! qN +pe~ iky S q / pN ) N , 


(4.49) 


(4.50) 


where fy N (k) is defined in Eq. (4.41) and we make use of the fact that 
q = 1 — p. We now will take the limit N — » oo. We first expand the quantity in 
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brackets in powers of k and obtain 


where 


/*(*)= (l-jtfO+Kw)) , 


„ ^ 1 ( ik\ m - 2 p<r + <l(-p) m 

" kn'XjN) sJW* f ' 


As N — ► oo, we have — * 0 and 


AW = J™ A» w = & (' " !v ' (1 + Rn) ) = e "“ 2)/2 ’ 


where we have used the definition 


lim (l+^)*= e z . 

N — >oo V N J 


(4.51) 

(4.52) 

(4.53) 

(4.54) 


Thus, in the limit N — > oo, we obtain 

i r 00 i 

p z (z) = i - dke-^e-#' 1 = -L e-?! 2 . ( 4 . 55 ) 

27T J —oo V27T 

It is easy to show that the first moment, (z), equals 0 and that the standard 
deviation, oz, equals 1. 

For the case N » 1 but N finite, we still have Pz N (z ) « ( l/\/27r)e ~ z 2//2 . 
Transform back to the stochastic variables, y = cryz + (y). Then 

•OO J 

^v(y) « dz6(y - a Y z - (y))Pz(z) = 7 ==exp 

J— oo OyyJ Z'K 

( 4 . 56 ) 

Equation (4.56) is the Gaussian probability density for the number of outcomes, 
+1, after many trials. It is also the most general form of a Gaussian probability 
density. It is important to note that the Gaussian probability density is entirely 
determi ned in term s of the first and second moments, (y) and (y 2 ) (since 

(jy = y (y 2 ) — (y) 2 )- In Exercise 4.4, we have plotted a Gaussian probability 
density for (y) = 0 and cry = 2. 


( (y~(y)) 2 \ 

( 24 ■ 


4.E.3. The Poisson Distribution 

The Poisson distribution can be obtained from the binomial distribution in the 
limit N — * oo and p — * 0, such that Np = a <C N (a is a finite constant). Let us 
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return to Eq. (4.41) and let p = (a/ N). Then 

/>■»« = (i - 1 +^“) • 

If we now take the limit, N — » oo, we obtain 

f Y {k) = lim fy N (k) = lim (l+^-(e ik - 1)) =exp(a(e lk - 1)) 

N—>oo N— »oo \ ly / 

o° r* 

= e-Y- 

tn 


(4.57) 


, imk 


(4.58) 


m= 0 


Thus the probability density is 






dfo? ‘*7y(fc) = ^2 — y e a 6(y - n x ) 


nj= 0 1 


(4.59) 


The coefficient 


/>(«,) = ^U-, (4.60) 

rip. 

is commonly called the Poisson distribution. It is the probability for finding n\ 
outcomes, +1, after many trials, when the probability, p (for an outcome, +1, in 
a single trial), is very small (p <C 1). The first moment, (ni), equals a. The 
Poisson distribution depends only on the first moment, and therefore it is 
sufficient to know only the first moment in order to find the probability density 
for a Poisson process. In Fig. 4.3, we plot the Poisson distribution for a = 2. 


Pio(ni) 



Pig. 4.3. The Poisson distribution for (n) = a = 2. 
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■ EXERCISE 4.11. A thin sheet of gold foil (one atom thick) is fired upon 
by a beam of neutrons. The neutrons are assumed equally likely to hit any 
part of the foil but only “see” the gold nuclei. Assume that for a beam 
containing many neutrons, the average number of hits is two. (a) What is the 
probability that no hits occur? (b) What is the probability that two hits 
occur? 

Answer: Since the ratio (area nucleus/area atoms) « 10~ 12 , the probability 
of a hit is small. Since the number of trials is large, we can use the Poisson 
distribution. Let n\ denote the number hits. Then (nj) = 2 and the Poisson 
distribution can be written P{n\) = e~ 2 2 ni /nil. 

(a) The probability that no hits occur is P(0) = e~ 2 2°/0l = 0.135. 

(b) The probability that two hits occur is P{ 2) = e~ 2 2 2 /2\ = 0.27. 


4.E.4. Binomial Random Walk 

The problem of random walk is an example of an application of the binomial 
distribution to a problem in physics. Consider a particle which is constrained to 
move along the x axis. It has probability, p = of taking a step of length, A, to 
the right and a probability, q = \, of taking a step of length, A, to the left. Let us 
assume that the particle takes N steps and that the outcome of each step is 
independent of previous steps (the steps are statistically independent). For the 
ith step, let the stochastic variable, Xj, have realization, x = +A, for a step to 
the right and x = —A for a step the left. The probability density for the ith step 
is Px, (x) = \ (<*>(.* + A) + 6(x — A)). The characteristic function for the ith step 
is fx t (k) = cos(fcA). 

The net displacement, Y # (along the x axis), of the particle after N steps is 

given by = X\ + f Xn. The characteristic function for the stochastic 

variable, Y N , is 

t n\ ( n \\\U (i k 2 A 2 \ N Nk 2 A 2 

frM = (cos(*A)) = ( 1 2! — I 7 ~ 1 2!~ + "‘ ^ 4 ' 6 

[cf. Eqs. (4.40) and (4.41)]. The first and second moments are (y) = 0 and 
(y 2 ) = NA 2 , respectively, and the standard deviation is <jy N = A \/N. In Fig. 4.4 
we show three realizations of this random walk for N = 2000 steps and A = 1 • 
There are 2 N possible realizations for a given value of N. 

We can find a differential equation for the characteristic function in the limit 
when the step size, A, and the time between steps, r, become infinitesimally 
small. Let us write the characteristic function, fy N (k) = fy(k,Nr), where t = Nr 
is the time at which the Mh step occurs. Also note that initially ( N = 0) the 
walker is at y — 0. Thus, Py 0 (y) = <5(y) and fy(k, 0) = 1. From Eq. (4.61), we 
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Fig. 4.4. Three of the 2 N possible realizations of the binomial random walk for 
p — q = i step size, A = 1, and number of steps, N = 2000. N(y) is the number of 
times the walker reaches point y. For this case (y) = 0 and a Y « 44.7. In each case the 
walker starts at y = 0. 


can write 


fr(k, (N + 1 )r) —f Y (k, Nr) = (cos(&A) - l)f Y (k,Nr ) 


(4.62) 


where we have expanded cos(&A) in powers of kA. Now take the limits 
N — > oo, r — ► 0, and A — ► 0 so that D = (A 2 /2r) is finite and Nr = t. If we 
note that 


lim lim 

N—>oo t— ► 0 


im fr(k,(N+l)T)-f y (k,NT) dfy(k, t) 
->o r dt 


(4.63) 


/ k A 2 \ 

+ = (4.64) 
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then combining these equations we obtain 


dfyjk, t ) 

dt 


—Dl?fy(k, t). 


For the case fy(k, 0) = 1, Eq. (4.65) has the solution 

f r (k,t)=e- Dkh . 


The probability distribution is 


P(y,>) 


i 

2n 


dke 


-iky e -Dk 2 t 



(4.65) 


(4.66) 


(4.67) 


The probability de nsity for the random walker is a Gaussian whose standard 
deviation is a = \j2Dt. The probability density spreads with time. 

We can derive Eq. (4.67) in a completely different way. If we make the 
change of stochastic variables, Z N = (Y^/Ay/N), then the characteristic 
function, / z „ (k), associated with the stochastic variable, Z N , is 

f z „(k) = (cos (k/SR)) N (4.68) 

[cf. Eq. (4.50)]. Following Section 4.E.2, we can take the limit N — ► oo and find 

fz(k) = lim fz N {k ) = exp (- y") , (4.69) 

N—>oo \ L J 

and therefore P z {z ) = (l/v^7r)exp(— z 2 /2). For very large N, we can write 
p z N (z) ~ Pz(z). The probability density for the net displacement, Yn, then 
becomes 


p y N (y) 


dz6(y - A VNz)Pz(z) = 


1 


V2ttA 2 N 


exp 


y 2 \ 

2A 2 Nj' 


(4.70) 


If the particle takes n steps per unit time, then N = nt and the probability 
density that the particle lies in the interval, y — ► y + dy, along the x axis, at time 
t is 


J OO 

dz8(y - A y/Nz)Pz(z) = 

— OO 


1 


\/4TcDt 


exp 


y_ 

4 Dt 


(4.71 


where D = \ nA 2 is the diffusion coefficient for the random walk. The 
probability density as a function of time is plotted in Fig. 4.5. Initially, P(y , 0 
has the form of a delta function, <5(y). As time passes, it spreads in such a way 
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0.4 

0.3 

0.2 

0.1 


Fig. 4.5. A plot of the probability density, P (y, t) [cf. Eq. (4.71)], for D = f, 
t= 1,10, 100. 

that the total area underneath the curve is equal to one, but the probability of 
finding the particles at so me dis tance alo ng the x axis grows with time since the 
standard deviation, a = V A 2 N = s/'lDt, grows as the square-root of the time. 

4.F. A Central Limit Theorem [6] and Law of Large Numbers 

It can be shown that the probability density describing the distribution of 
outcomes of a large number of events universally approaches a Gaussian form 
(provided the moments of the distribution for the individual events are finite). 
This is called the Central Limit Theorem. The central limit theorem has been 
proved in complete generality in Section S4.C. Below we give a slightly weaker 
version, directly applicable to measurements. This result shows why Gaussian 
distributions are so widely seen in nature. Another result of considerable 
importance is the law of large numbers. The law of large numbers gives 
quantitative justification to the use of probabilities. 

4.F.I. A Central Limit Theorem 

The Central Limit Theorem (cf. Section S4.C) can be stated in a simplified form 
which has direct relevance to the statistics of measurements. Let us consider a 
stochastic variable, Y which is the deviation from the average of N statistically 
independent measurements of a stochastic variable, X. We write Y N as 

Tat = — (Xi H b X # ) — (x) = Z\ + • • ■ + Z/v, 

z t = ± j (X i -(x)) 



where 


(4.72) 
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The characteristic function, f z (k]N), for the stochastic variable, Z t ~ 
( 1 /N) (Xi — (x)), can be written 

( °0 i jl2 

dxe'WW'-MPxM = 1 - 2W + ' ' ' ’ (4 ' 73) 

where cr| = (x 2 ) — (x) 2 . For large N and finite variance, higher-order terms in 
the expansion of the right-hand side can be neglected. (Note that the oscillatory 
character of the integrand ensures that the integral goes to zero for large k.) The 
characteristic function for Y N is then 

frs(k) = (l + ~* exp ( - l^) as ( 4 - 74 ) 

In Eq. (4.74), we have used the identity lim^_ 400 (l + x/N) n = e x . Thus 


PrM ^iL dke,kytxp (- k W) 



(4.75) 


as N —> oo. Regardless of the form of Px(x), if it has finite moments, the 
average of a large number of statistically independent measurements of X will 
be a Gaussian centered at (x), with a standard deviation which is 1 /\/N times 
the standard deviation of the probability density of X. 


4.F.2. The Law of Large Numbers [3, 4] 

The law of large numbers underlies the intuitive concept of probability that we 
introduced in Section 4.C. Much of the content of the law of large numbers is 
contained in the Central Limit Theorem as applied to measurements. We will 
give a simple version of it here. 

The law of large numbers applies to N independent experiments and may be 
stated as follows: If an event. A, has a probability, p, of occurring, then the 
fraction of outcomes. A, approaches p in the limit N — > oc. The proof has two 
steps. The first step involves the derivation of the Tchebycheff inequality. The 
second step uses this inequality do derive the law of large numbers. 

The Tchebycheff inequality establishes a relation between the variance and 
the probability that a stochastic variable can deviate by an arbitrary amount, e (£ 
is positive), from its average value. The variance, o z , of a stochastic variable, Z, 
is written 


4 = 


dz{z - {z)) 2 Pz{z). 


(4.76) 


If we now delete that range of the variable, z, for which ]z — (z)| we can 
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write 


<T 7 ^ 


{Z)~£ 


dz(z - {z)) 2 P z (z) + 


{z)+£ 


dz(z - { z)) 2 P z {z )• 


(4.77) 


Since under the integral | z — (z ) | ^£ 2 , we can replace (z - (z)) 2 by e 1 in Eq. 
(4.77) and write 




r(z)-£ oo \ 

dzP z (z)+ dzP z (z)) = e 2 P(\z- (z) \>e), (4.78) 

J-00 J(z)+£ ) 


where P(|z — (z)\^e) is the probability that the stochastic variable, Z, deviates 
from (z) by more than ±e. From Eq. (4.78), we obtain the Tchebycheff 
inequality 

P(\z-{z)\>e)*£. (4.79) 

Thus, for fixed variance, cr\, the probability that Z can differ from its average 
value by more than ±e decreases as e~ 2 for increasing e. 

We now come to the law of large numbers. Let us consider N independent 
measurements of the stochastic variable, Z. Let Y N be the mean value of the 

outcomes, Y N = (\/N){Z\ H b Zn), where Z, is the outcome of the ith 

measurement. The law of large numbers states that the probability that Y N 
deviates from (z) goes to zero as N oo. Thus, the limw-Kx, 
P(\yN — {z)\^e) = 0. To prove this, let us first note that ( y n) = (z). Since 
we have independent events, the variance, ay N , behaves as cr\ N = cr^/N. We 
now use the Tchebycheff inequality to write 

P(\y N -(z)\>e)<^- = ^ i . (4.80) 

Thus, we find 

lim P(|yyv - (z) \>e) = 0, (4.81) 

/v — ►OO 


provided that oz is finite. 

► SPECIAL TOPICS 

► S4.A. Lattice Random Walk [7] 

Random walks on lattices provide some of the simplest examples of the effect 
of spatial dimension on physical processes. This type of random walk is so 
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• ♦- - - -4 ♦ ♦- - - -• • 

-N -N+1 -1 0 1 N-1 N 

Fig. 4.6. A one-dimensional lattice with period 2 N + 1 . If the walker steps right at site 
l = N, it enters from the left at site / = —N. 


simple that we can use the concepts of the previous sections to compute the 
probability that the walker will reach any other part of the lattice. This section is 
based on a discussion of lattice random walks by E. Montroll [7]. We will find 
that random walks can have quite different properties in three dimensions than 
in one or two dimensions. 


► S4.A.1. One-Dimensional Lattice 

Let us consider a random walker on a periodic one-dimensional lattice with 
2N + 1 lattice sites (cf. Fig. 4.6). Let P s (l) be the probability to find the walker 
at site, /, at discrete time, s. Since the lattice is periodic, P s (l) = P s (l± 
[2 N + 1]). If the walker steps from site N to the right, he enters again from the 
left at site —N. We will assume that at each step the walker steps one step to the 
right with probability p = \ , or he steps to the left with probability q = \. The 
probability density for the ith step is 

p(l i )=i(S,„ + 1 (4.82) 


We will assume that each step is independent of previous steps and that the 
walker starts at site / — 0 at time 5 = 0. Then the position of the walker at time 5 

(after s steps) is l = l\ + h H 1 -l s . From Eq. (4.40), the probability, P s (l), is 

given by 

m=±-± fy/i+/ 2 +- +iXh)p(h) x ••• x p(l s ). (4.83) 

h=-N l s =-N 

Since P s (l ) is a periodic function of l, we can expand it in a Fourier series 


Ps(/)= 2jvTr£/ Wexp ( 


27rinl \ 

2aT+T/’ 


(4.84) 


where f s (n) is the Fourier amplitude and is also the characteristic function tor 
this stochastic process. We can revert this equation and write the Fourier 
amplitude in terms of P s (l), 


fs(n) = exp + 


l=-N 


2rcinl 


2M -f 1 


(4.85) 
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We have used the identity 

J> p { ± wfi) = (2N+l) ^ 

We can also Fourier transform the transition probability, p(l ) (cf. Eq. (4.82)), to 
obtain 

A(„) ^ J>« exp (+ = cos (-^ ) , (4.86) 

where A(n) is the Fourier amplitude. From Eqs. (4.83) and (4.85), we can 
write 

M») = (A(«)) s (4.87) 

for the characteristic function of the stochastic process. 

Let us now take the limit N — > oo. First we introduce the variable 
(f) = (2Trn/(2N +1)). Then for large N , 

1 1 fW i r* 

(2N + 1)J? n = (2/V+l)J_/" ~ 2w J_/ 0 ’ 

and the probability, P 5 (/), in the limit N — > oo, becomes 

/>,(/) = L f (4-88) 

— 7T 

and 

OO 

/.( 0 ) = J2 ( 4 - 89 ) 

/= — 00 

The single-step probability becomes 

p ( ,i)= ^r (4 - 9o) 

From Eq. (4.87) we obtain 

/,(0) = (W- (4.91) 

(Note that if the walker starts at / = 0 at time s = 0, then / o (0) = 1.) 
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It is now easy to compute P s (l) for a random walk on a one-dimensional 
lattice. If the walker starts from site, / = 0, we obtain 


- for s — l even and s^l, 

! 

1, 0 otherwise. 

It is useful to ask another question about the walker. What is the probability 
that the walker will escape from the origin? To formulate this in a simple way, 
we first introduce a generating function, U(z, /), for P s (l): 

00 1 P7 r ~il(t> 

U(Z ,D = -uL‘ (493) 

To obtain the term on the right, we have used Eqs. (4.88) and (4.91) and we 
have summed over s. The probability, P s (l), is then obtained from U(z, l ) by the 
equation 

(494) 





= < 


d<f)(cos(4>)y e ll<t> 


5! 


2/ [s — r\,/s + l 


■ EXERCISE 4.12. Compute the generating function, U(z,0), for the 
probability to be at site / = 0, at time s, given that the walker was at site 
/ = 0 at time 5 = 0. 

Answer: Since /o(<£) = l,(/(z,0) is given by U(z,0) = (l/27r) ^ n d(p(\/ 
[1 — zcos(</>)]). We can make the change of variables, x = tan(</>/2). Then 
cos(0) = ((1 — jc 2 )/(1 -h jc 2 )) and d<t> = 2dx/(\ -t-x 2 ) and U(z, 0) = 
( 2 /tt) J 0 °° (dx/[(l - z) + X*(l + z) ]) = (1 -z 2 y x/1 . 


We will now let Q s (l ) denote the probability that the walker arrives at site / 
for the first time after s steps. We can introduce a generating function V(z,l) f° r 

OO 

v(z,i) = £V&(0- ( 4 - 95 > 

s= 1 

Thus, V(1 ,/) is the total probability that the walker reaches site l during the 
random walk. 
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We can now relate the generating functions V(z,l) and U(z,l ). The walker 
can arrive at a lattice site / for the first time after s steps or for the first time after 
one step and then again after s — 1 steps in any manner, and so on. Thus, if the 
walker starts at / = 0, 

Po(l) = 6 lfit (4.96) 

and we can write 

Ps(f) = Qj( l ) p s-j(°) for S > °- ( 4 - 97 ) 

j=i 

If we combine Eqs. (4.93), (4.96), and (4.97), we obtain 

00 00 s 

U(z, l) = ho + £ fP,(‘) = ho + £ £ Qi(l)P s -i(0)zf. (4.98) 

S=1 i=l j=l 

We can change the order of the summation in Eq. (4.98) (2~i l = 
an( i perform the summation. Then we find that U(z,l) = 
Si t o + V(z!l)U{z,0) and 


V(zJ) 


U(z,l)-6 l]0 
U(z, 0) 


The total probability to return to the origin is given by 


V(1,0) = 1 - 


1 

(7(1,0)’ 


(4.99) 


(4.100) 


and the total probability that the walker escapes from the origin is 


1 

Pescape ~ (7(1,0) ' 


(4.101) 


For the one-dimensional walker, (7(1,0) = oo and P escape = 0 (cf. Exercise 
4.12). Thus, for random walk on a one-dimensional lattice, the walker will 
return to the origin with probability equal to one if it starts at the origin. The 
walker cannot escape. 

It is very interesting to generalize the above discussion to random walks in 
two or more dimensions, as we shall see below. 


► S4.A.2. Random Walk in Higher Dimension 

To generalize the above results to ^/-dimensional lattices, we introduce unit 
vectors, £*(* = 1,2,..., d) for the various orthogonal directions in space. If the 
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lattice sites are a unit distance apart, we can denote their position by the vector, 

1 = /jx, 4 b ld*d, where /, are integers. The probability to find the walker at 

site, 1 at time s is P, (I) = P s (li , . . . , Id). We will let the lattice have 2 N + 1 sites 
in each spatial direction so that —N ^ /, (i = 1 We will assume that 

the lattice is periodic in each direction. That is, 

Ps(h,h, Ps(l i ± [2N + 1], / 2 , . . . , Id) 

= P s (luh±[2N + l},... ,!„) = ■■■ (4.102) 

= P,(l \,h,...J d ±[2N+l\). 

The probability, /^(l), can be expanded in the Fourier series: 

V / n\ = -N n d =-N v ' 

where n = n\\\ -I b n d x d , and the Fourier amplitude, /^(n) =f s (ni , . . . , n d ), 

is given by 


/»(“) = Y, ••• S (4 ' 104) 

h=-N l d ~—N + ' 

Let us now take the limit N —* oo. We introduce the variable, 
<f>i = 2imi/(2N +1) (i = 1, . . . ,d), and <p = 0jxH b 4>d*d • Then we find 


and 


/M') = 



dtf>dfA.<P) e "'*1 

J —7 r 


(4.105) 


OO OO 

/.W= £ £ F,(l)e + "r (4.106) 

/]= — 00 l d =-o o 


It is of interest to consider a random walk on two- and three-dimensional 
simple square lattices, where the walker steps to one of its nearest neighbors at 
each step. Let us first consider the case, d = 2. Such a lattice is shown in 
Fig. 4.7. If the lattice spacing is a = 1 and the walker is equally likely to step in 
any direction, the single step probability can be written 

pi}) = 7 ^2 (^1^2,0 + ^ ( 4 . 107 ) 

£=±1 

where 1 = l\X\ + hx 2 . The characteristic function for the single-step 
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Fig. 4.7. (a) A simple square two-dimensional lattice, (b) Random walk for 1000 steps 
on a square two-dimensional lattice. The walker starts at 1 = 0. 


probability is 


OO OO i 

A(q>)= £ £ pm^'^-tcos^O + cos^)), (4.108) 

l\—-ooh=-oo ~ 


where 


<p = + (f) 2 \ 2 ■ 


If the walker starts at 1 = 0, then/ 0 (<p) = 1 and the generating function, U[z , 1), 
is given by [cf. Eq. (4.93)] 




d(f> i 


1-/^1 


, — il-(p 


-f (cos(^i) + COS(0 2 )) 


(4.109) 


Let us compute U(z, 0). We make the change of variables, jc, = cos (</>,). Then 


17(1,0) = 



dx\dx 2 


- 1 y/1 — x\ y/l -x\[\ -|(JC1 + * 2 )) 
We first integrate over x 2 and obtain 

i/(i, «) -if ** 


(4.110) 


-g)‘ 


(4.111) 



206 


ELEMENTARY PROBABILITY THEORY AND LIMIT THEOREMS 



Fig. 4.8. A simple cubic lattice with lattice 
spacing, a. 


(cf. Ref. 8). We next integrate over x\ and find 

U{l,0)=-K(z), 

TX 


(4.112) 


where K(k) is the complete elliptic integral of the first kind (k is the modulus). 
When k — » 1 ,K(k) — * oo, and when z — ► 1, £7(1, 0) — > oo. Thus, the escape 
probability for a random walker on this two-dimensional square lattice is zero, 
P escape = 0 [cf. Eqs. (4.100) and (4.101)]. In Fig. 4.7, we show one realization 
of a random walk on the d = 2 square lattice. The walker starts from 1 = 0 and 
1000 steps are shown. 

Let us now consider a random walk on a simple cubic lattice with lattice 
spacing, a = 1 (cf. Fig. 4.8). In this case, the single-step probability is 


p(l) = ^ (<V<-£/ 2 ,o<5/3,o + £/,,o<5/ 2 ,£<5/3,o) + <5/,,o<5/ 2 ,o<5/ 3 , £ ) (4.1 13) 

£—±l 

and the characteristic function is 

A(cp) = ±(cos(0i) + cos(0 2 ) + cos(0 3 )). (4.114) 

If the walker starts at 1 = 0 , the generating function, U(z, 0 ), is 


UM= [h 


d<f>i 


d(j> 2 


d(f>3 


1 


1 - f (cos(0i) + COS (02) + COs(0 3 )) 


(4.115) 


Let us now compute the escape probability, P escape = l/t/(l,0). We may write 

3 m rn r n d&ldfadfo 


0) = (i) pr 

W Jo Jo Jo 1 — 3 


(COS (01 ) + COS (02) + cos(0 3 )) 


(4.116) 
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With a change of variables, jc, = cos (<&), £7(1, 0) takes the form 


1/(1, 0) 



dx}dx 2 dxj 

yi -^\/i -40 - j(*i +Jt2 +* 3 )) 


(4.117) 


We first integrate over x 3 and obtain 


£/(!,#) = 



dx\dx2 

V 1 ~ x W i - x 2 \jiy - 5(^1 + x i)f -5 


(4.118) 


(cf. Ref. 8). We next integrate over X 2 and find 


t/(l,0) 


6T 1 dx 1 ( 2 \ 

7T 2 J-l (3 - Xi)y/l ~X\ V 3 - X J 


1.516 


(4.119) 


(cf. Ref. 7), where K(k) is the complete elliptic integral of the first kind (k is the 
modulus). From Eq. (4.119) we have computed £7(1,0) numerically. An 
analytic expression is obtained in Ref. 9. The escape probability for a random 
walker on a simple cubic lattice is P escape — 1 / £7( 1 , 0) « 0.66. Thus, for this 
three dimensional simple cubic lattice there is a large probability that a random 
walker, who starts at the origin, will never return to the origin. The walker can 
escape. 

The results of this section are special cases of a more general result due to 
Polya [10] where he shows that random walkers (whose step size distribution 
has finite variance) on one- and two-dimensional lattices can never escape their 
origin. They eventually will return to their starting point. But in three 
dimensions, there is a finite probability that a walker can escape forever from its 
starting point. Random walks for which the escape probability is zero are said 
to persistent. Random walks with finite escape probability are said to be 
transient. 


► S4.B. Infinitely Divisible Distributions [6, 11] 

The concept of an infinitely divisible distribution function is important to all 
limit theorems in statistical physics. The limiting distribution can only belong 
to this class of distributions. A stochastic variable, Y, is infinitely divisible if for 
any integer N, it can be represented by a sum 


Y = Xi+X 2 + ---+ Xn 


(4.120) 


of identically distributed stochastic variables, Xi(i — 1 ,... ,N). The distribution 
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functions, F Y (y), for infinitely divisible stochastic variables, Y, are called 
infinitely divisible distribution functions. The distribution function, Fy(y), is 
infinitely divisible if and only if, for any N, its characteristic function f Y (k) is 
the Mh power of some characteristic function, f x (k,l/N). Thus, fy(k) = 
(f x (k,l/N)f or 

A(*> l/AT) = (Mk))' ,N (4.121) 

(we take the principal branch of the Mh root). Infinitely divisible distributions 
may have either finite or infinite variance. We give some examples of infinitely 
divisible distributions below. 


■ EXERCISE 4.13. Show that the characteristic function for an infinitely 
divisible distribution has no real zeros. 

Answer: Let g(&) = \im.N^ 00 (f(k)) l ^ N . Then either g(k ) =0 or g(k) = 1 
since g(k) = 0 when/(&) = 0 and g(k) = 1 when/(fc) 7 ^ 0. Since /(0) = 1 
and f(k ) is continuous, f(k) ^0 in a neighborhood of the origin and 
g(k) = 1 in a neighborhood of the origin. But since g(k) is also a 
characteristic function, it is continuous and cannot jump from value g(k) = 1 
to g(k) = 0 at some value of k. Therefore, g(k) = 1 for all k and f{k) f 0 for 
all k. The converse, however, need not be true. If a characteristic function has 
no real zeros, it still might not be infinitely divisible. 


^ S4.B.1. Gaussian Distribution 

The Gaussian distribution is infinitely divisible. To see this, let us consider the 
most general form of the Gaussian probability density for the stochastic 
variable, Y. The probability density is 


Pr(y) 



(4.122) 


with average (y) = a and standard deviation cry. The characteristic function is 

f Y (k) = exp (^iak - ^ k 2 ^j . (4. 1 23) 

The Mh root is f x {k\ l/N) = ( f Y (k)) 1/N = exp (i{a/N)/k - ( o\/2N)k 2 ). Thus, 
the probability density for each of the identically distributed stochastic 
variables, X t is 

, , I 1 ( (x — a/N) 2 \ 

Px(X ' 1/N) = Y27r(4/W) eXP 2 (4/M) J ’ 


(4.124) 
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and the average value and standard deviation of the stochastic variable X, are 
(x) = (a/AO an d o x = ( oy/y/N ), respectively. 

^ S4.B.2. Poisson Distribution 

The Poisson distribution is infinitely divisible. The Poisson probability density 
for a stochastic variable, Y, is given by 

00 \m p -X 

Priy) = -S(y-a-mh) ( 4 . 125 ) 

m= 0 m ‘ 

where (y) = a 4- \h is the average value of Y and ay = hy/X is its standard 
deviation. The characteristic function for stochastic variable Y is 

f Y {k ) = exp (iak + \{e lkh — 1 )). ( 4 . 126 ) 

The Mh root is f x {k\\/N) = (fy(k)) 1/N = exp (i(a/N)k + \/N{e ,kh - 1 )). 
Thus, the probability density for each of the N identically distributed stochastic 
variables X, is 


OO i / \ \ m 

p x (x; 1/1 V) = £- b) e-V"6(x - % - mh) (4.127) 

and the Poisson distribution is infinitely divisible. The average of X and its 
standard deviation are given by (x) = (a + \h)/N and <j x = {oy/VN), 
respectively. 


)► S4.B.3. Cauchy Distribution 

The Cauchy distribution function, 


My) 


1 

7T 


7T 

j + tan 



(4.128) 


(a > 0 and b are finite constants) for stochastic variable Y, is an example of an 
infinitely divisible distribution which has an infinite variance. The probability 
density for a stochastic variable, Y, is 


My) = 


dFy 

dy 


1 a 

ft a 2 + {y — b) 2 ' 


(4.129) 


The first moment is (y) = b, but the variance and the standard deviation are 
infinite. The characteristic function for stochastic variable Y is 


f Y (k ) = exp ( ikb — |fc|a). 


(4.130) 
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Fig. 4.9. A plot of the Cauchy probability density for b = 0 and a = n. 


The Mh root is fx(k, l/N) = (fy(k)) l ^ N = exp(ik(b/N) — \k\(a/N)). Thus, the 
Cauchy distributions is infinitely divisible. The probability density for each of 
the N identically distributed stochastic variables X, is 


P x {x- 1/AO = - 


a/N 


n(a/Ny + {x-b/N) 


2 • 


(4.131) 


From Eq. (4.131) we find (jc) =b/N. The Cauchy distribution is plotted in 
Fig. 4.9. 


► S4.B.4. Levy Distribution 

The Cauchy distribution is a special case of a more general distribution, called 
the Levy distribution. The Levy distribution has infinite variance and has a 
characteristic function of the form 

f Y {k) = exp(— c|fc| a ), (4.132) 

where 0 < a < 2. We can only obtain an analytic expression for fV(y), for a 
few special cases. We shall discuss the properties of these distributions in more 
detail in Section (S4.D). 


■ EXERCISE 4.14. Consider the characteristic function f(k ) = 
(1 — b)/( 1 - be ik )( 0 < b < 1). Show that it is infinitely divisible. 

Answer: First note that 

OO Iff! 

ln/(«r) = In (1 - b) - In (1 - be‘ k ) = £ — (e ikm - 1). 

m~ 1 



SPECIAL TOPICS: THE CENTRAL LIMIT THEOREM 


211 


Thus 

OO 

/w =n^ / ") <c “- ,> . 

m= 1 

Since f{k) is a product of Poisson characteristic functions, it is infinitely 
divisible. 


^ S4.C. The Central Limit Theorem 


We wish to consider the sum, Y N , of N independent identically distributed 
stochastic variables X,- such that 

y A ,=X 1 + ---+X A ,. (4.133) 


We will let Fy N (y) denote the distribution function, and fy N (k) the characteristic 
function, of stochastic variable Y N . We will let F x (x; l/N) denote the distribu- 
tion function, and f x (k\ l/N) the characteristic function, of stochastic variables 
X- ^ . The characteristic function for the stochastic variable Y N can be written 


( OO 1-00 

< it! ■ dx N e*( x ' + - +X ^p x (x,;l/N) x 

— 00 J— 00 

= (f x (k-l/N))" 


• • x P x (x N ; l/N) 


(4.134) 


[cf. Eq. (4.35)], where P x {x ; l/N) = dF x {x ; 1 /N)/dx is the probability density 
of X,. 

The Central Limit Theorem describes the limiting behavior (Af — » oo) of 
the stochastic variable Y N for the case where the stochastic variables X, have 
finite variance. We will also assume that X, has zero mean, (x) = 0, since this 
causes no loss of generality. More specifically, we consider system for which 

lim f x {k] l/N) = 1 (4.135) 

/V— >OC 

and 

lim Nix 1 ) = C, (4.136) 


where C is a finite constant. For such systems, the limiting distribution function, 
Fy(y) = lim^oo Fy N (y) is infinitely divisible even though Y N and X, might not 
infinitely divisible. Furthermore, the limiting distribution is a Gaussian. This is 
the content of the Central Limit Theorem. 
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Before we take the limit, N — » oo, and prove the Central Limit Theorem, it 
useful in the next section to obtain some inequalities. 

► S4.C.1. Useful Inequalities 

Let us first define 


A fx,N =fx(k ; 1/AO - 1 = 


(e‘ b - 1 )dF x (x; l/N) 


= J(e ,b: — 1 — ikx)dFx{x , l/N). 


(4.137) 


where we have used the fact that (jc) = 0 in the last inequality. Next note that 

\ e lkx — \ —ikxl^kx 1 (4.138) 

(we have plotted the left- and right-hand sides of Eq. (4.138) in Fig. 4.10. If we 
combine Eqs. (4.137) and (4.138), we obtain 

\Af X)N \ = \fx(k; l/N) - 1 (4.139) 


This is our first inequality. 

Let us now obtain a second inequality. For sufficiently large N, we can 
assume that A f x ,N < \ [cf. Eq. (4.135)]. Then 


IM/x(fc; l/N)] — A/x,n\ — |ln[l -F A/x,v] — Af x ^\ 



< 




OO 


iA/x,«r 


1 |AA,m| 2 

2 1 — \AfxM 


^ IA/x,v| 2 - 


(4.140) 



Fig. 4.10. Plot of f(x) = \e ikx — 1 — ikx | 
(solid line) and of f(x ) = f /t 2 (dashed 
line) for k = 1. 


-10 


-5 
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We can now combine the above inequalities to obtain 

llnt/iv) - NAfw\ = AT|ln(A) - Af XiN \ N\Af x , N \ 2 a{Nk(J)\Af x , N \, 

(4.141) 

where A fx^ is defined in Eq. (4.137). We will use these inequalities below. 

S4.C.2. Convergence to a Gaussian 

If we combine Eqs. (4.136) and (4.141), we find 

lim |ln fy N (k) — N ( e lkx — 1 — ikx)dFx(x\ l/N)\ = 0, (4.142) 

N — ►oo 

since lim fx{k, l/N) —> 1. Therefore, 

N — >oo 

f Y {k) = lim fy N (k ) = lim exp (n \{e lkx - 1 - ikx)dF x ) , (4.143) 

N — >oo N — >oc \ J / 

where fy(k) is the limiting characteristic function. 

It is useful to introduce a function 

K n (u) = N JdFxik; l/N), (4.144) 

-00 

Then K N (— oo) = 0 and Kn(+oc) = N(x 2 ). Furthermore, Kn{u) is a non- 
decreassing bounded function. Using Eq. (4.144), we can write 

TV j (e ikx - 1 - ikx)dF x {x-\/N) = {e ikx - 1 - ikx)^dK N {x). (4.145) 

Thus, the limiting characteristic function, f Y { k), takes the form 

f Y (k ) = lim exp [ \{e lkx — 1 — ikx)\dKN(x ) . (4.146) 

N->oo [J x L 

Equation (4.146) is the Kolmogorov formula (cf. Section S4.E). In Section S4.E 
we give a general proof that since K(u ) is a bounded monotonically increasing 
function, fy(k) is infinitely divisible. Below we show this for the special case of 
variables, X,, with finite variance. 

Let us introduce a new stochastic variable, Z„ which is independent of N, 
such that 


1 y/Naz 


(4.147) 
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Here oz is the standard deviation of Z, and is finite. We assume that the 
stochastic variables Z, are identically distributed. We see that (jc) = 0 and 
(x 2 ) = ((z 2 ) - (z) 2 )/A'ctj. Thus, 

=1 (4.148) 

°z 

and Eq. (4.136) is satisfied. Furthermore, we have 

Jim /*(*; 1/AO = Bm (^) = 1 (4- 149) 


(see Eq. 4.50) and Eq. (4.135) is satisfied. 

Let us look at the limiting behavior of K N {u) in Eq. (4.144). First note that 


so that 


Kn(u) = N 


dxx 1 PxU: 1/AO 
'® nu . (z-(z » 2 


<fe- 


Pz(z) 


lim Kn(u) = 

/V— > 00 


“ ) = {o 


for m > 0 
for u < 0 


(4.150) 


(4.151) 


It is easy to show that this is just the condition that the limiting distribution be a 
Gaussian. We can write Eq. (4.151) in the form lim/v-.oo Kn(u) = 0(m), where 
@(w) is the Heaviside function. Therefore, lim^oo dK N (u ) = 6{u)du. If we 
substitute this into Eq. (4.146), we obtain 

f Y (k) = e~ e l 2 . (4.152) 

From Eq. (4.55), this corresponds to a Gaussian probability density, 
Py(y) = e y ! 2 , centered at (y) = 0 with standard deviation equal to one. This 
result is the Central Limit Theorem. 


► S4.D. Weierstrass Random Walk [12, 13J 

One of the simplest random walks which does not satisfy the conditions of 
the Central Limit Theorem is the Weierstrass random walk. In the continuum 
limit, the Weierstrass random walk is governed by a Levy distribution and not 
a Gaussian. We will first consider the discrete case and then the continuum 
limit. 
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^ S4.D.1. Discrete One-Dimensional Random Walk 

Let us consider a walker constrained to move along the x axis. At discrete time 
intervals, r, the walker takes a step of length b n A to the left or to the right with 
probability (a — l)/2 a n+l ( n can be any integer, n = 0, 1, . . .). We assume that 
each step is independent of all the others. The displacement of the walker after 

N steps is Yn = X\ H b Xn, where X, is the displacement at the ith step. The 

probability density for the displacement X, is 

1 OO i 

p x, w = -^ £*[«(* - *" A ) + f (* + ( 4153 ) 

n = 0 

where a^l and 1. It is easy to show that (jc) = 0 and (x 2 ) = 
(a — l)A 2 /(a — b 2 ). Thus for b 2 < a we have (jc 2 ) < oo, and we expect the 
limiting distribution to be a Gaussian. For b 2 = a we have (jc 2 ) = oo, and for 
b 2 > a , (jc 2 ) is undefined. Therefore, for b 2 ^a the conditions of the Central 
Limit Theorem are not satisfied, and as N — ► oo the probability density for Y N 
need not approach a Gaussian. 

The characteristic function for the ith step is 

( 00 / _ _ 1 \ oo 1 

dxe ik >P Xt (x) = — £-cos(to"A). (4.154) 
-00 \ a / „= 0 a 

which is known as the Weierstrass function. We will consider /(£) for the case 
a = 4 and various values of &. For a = 4 and b=l,(x) = A 2 and 
f(Jc) = cos (kA). Thus, for this case the random walk reduces to a binomial 
random walk. In Fig. 4.11, we plot f(k) for the cases (a = 4,b = 3) and 
(a = 4,b = 5). The characteristic function, f{k), has the very interesting 
property that for b > a it has finite derivatives at no value of k [14]. In fact, 
except for special cases, the characteristic function has structure to as small a 
scale as we wish to look. 



Fig. 4.11. The Weierstrass characteristic function, f(k), for (a) a = 4, b = 3 
(/jl = In (a)/ In ( b ) = 1.26) (b) a = 4, b = 5 (/i = In (a)/ In (b) = 0.86). 
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Fig. 4.12. One realization of the Weier- 
strass random walk for N = 2000, A = 1 , 
a = 4, and (a) b — 5, (b) b = 3, and (c) 
b= 1. 


The characteristic function for the displacement Yn = X\ -\ b is 

/!-.(*) = (/(*))", ( 4 - 155 ) 

where / (k) is defined in Eq. (4.154). In Fig. 4.12 we show one realization of the 
Weierstrass random walk for N = 2000 and for cases (a = 4,b—l), 
(a = 4, b = 3), and (a = 4, b = 5). For the case (a = 4, b = 5), the random 
walk shows clustering with a spread that is growing dramatically with N. For 
the case (a = 4, b = 3) the random walk no longer clusters but still is spreading 
rapidly with N. For the case (a = 4,b = 1) the random walk reduces to a 
binomial random walk which spreads as y/N. 

It has been shown in Ref. 15 that this phenomenon of clustering persists as 
TV — * oo for the parameter range 0 < fi = ln(a) /In (b) < 1 . In the limit N — » oo, 
there will be clusters on all scales down to size A and the probability will have 
a fractal-like structure in space. In the parameter range 0 < /i < 1, it has been 
shown that the walker has a finite probability of escaping the origin. The 
random walk is transient. Therefore the walker does not return to smooth out 
the clusters. For 1 < /i < 2, the random walk is persistent and the clusters get 
smoothed out. 
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Fig. 4.13. Self-similarity embedded in the Weierstrass function. We plot 
f' n (k) = ( l/a m+ \)f(b n+1 k ) for a = 4, b = 5, and (a) n = 0 and (b) n = 1. In going 
from (a) to (b) we are enlarging the central peak. 


The characteristic function, f(k), has embedded in it a self-similar structure. 
To see this let us rewrite f(k ) in the form 

flk) = -f(bk) + !-cos(A£) (4.156) 

a a 


or, more generally, 

?/(«) = W + 'k)+ a -^-^cos(AVk). (4.157) 

We show this behavior in Fig. 4.13 where we plot (1 /a)f{bk) and (1 /a 2 )f(b 2 k) 
for (a — 4, b = 5). As we move to a smaller scale (increasing n ) we focus on the 
central peak of the previous scale. The figure is self-similar to as small a scale 
(in k ) as we wish to examine. 

It is important to note that/(/:) is not infinitely divisible because f{k ) can go 
to zero for some values of k (cf Exercise 4.13 and Fig. 4.11). 

► S4.D.2. Continuum Limit of One-Dimensional 
Discrete Random Walk 

Let us now take the continuum limit of the Weierstrass random walk. We 
proceed as in Section 4.E.4. Let us assume that the time between steps is r, and 
rewrite the characteristic function, fy N (k) =f Y (k,NT). We assume that initially 
the walker is at y — 0 so that Py 0 (y) = ${y) and the characteristic function, 
fy(k, 0), equals 1. We can write 

frik , (N+ 1)t) —fy(k, Nt) = ( f(k ) - l)f Y (k, N t), (4.158) 

where f(k ) is defined in Eqs. (4.154). We will now let a= 1 + a A and 
b ~ 1 + /3A and take the limits N — > oo, A — > 0, and r — ► 0 so that 
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fi = \n(a)/\n(b) — > a/p,Nr — > f, and A^/r — > <5. It can be shown that [13] 
Jim -(/(£) - 1) = /i6|*|^cos(^)r(-/*), (4.159) 

r— >0 A— >0 T \ Z / 

where r(— /z) is the Gamma function. 

For 0<^< l,r(-/j) is negative and cos(^7r/2) is positive. For 
1 < fi < 2,T (—fi) is positive and cos(/i7r/2) is negative. From Eq. (4.159), 
we can write 


(4.160) 

where 


D L = Mcos(^)r(^). (4.161) 

The solution to Eq. (4.160), for the case when/(fc,0) = 1, is 

/(M)= exp(-D L |*rO- (4-162) 

This is the Levy distribution. The probability density, P(y, t), for the 
displacement of the random walker in the continuum limit is given by 

1 f 00 

P(y,t) = ^-\ dke~ ,k!, e- DM ‘. (4.163) 

^ J —00 

Bochner [16] has shown that P(y,t) is only a nonnegative function for 
0 < fi ^ 2 and therefore can only be interpreted as a probability for this range of 
fi. It is easy to check that P(y, t ) is normalized to one. 

There are only a few values of /i for which Eq. (4.163) can be integrated and 
obtained in closed form. The case /i = 1 gives the Cauchy distribution [cf. Eq. 
(4.163)], while the case fi — 2 gives the Gaussian distribution. In Fig. 4.14 we 
show plots of P(y , t ) for t — \/Di and for [i = 0.86 and fi = 1.26, the cases 
considered in Fig. 4.11. P(y, t ) has a very long tail indicating that there is no 
cutoff in the structure for small k (long wavelength). 


^ S4.D.3. Two-Dimensional Discrete Random Walk (Levy Flight) 

The Levy flight is a special case of the more general Rayleigh-Pearson random 
walk [17, 18] in two and three dimensions. We will first set up the Rayleigh- 
Pearson random walk and then consider the case of a discrete Levy flight. 

Let us consider a random walk in the (jc, y) plane such that at the ith step the 
particle takes a step of length a t along a path that makes an angle 0, with respect 
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Fig. 4.14. Plots of P(y, t ) for n = 0.86 (solid line) and /i = 1.26 (dashed line) for time 
t=l/D L . 



Fig. 4.15. The coordinates for a two-dimensional random walk. 


to the x-axis (cf. Fig. 4.15). We will assume that r, and 0; are independent 
stochastic variables, that a, has a probability density / > (r I ), and that 0, is 
uniformly distributed over the interval 0 — > 27r so that the probability density 
P(9i) — (1/27 r). If the walker is at (x = 0,y = 0) at the initial time, then after 
N steps it will be at (x = x N ,y = y N ), where x N = ricos(0i)+ 
r 2 Cos (0 2 ) 4 1- r N cos{0 N ) and y N = nsin(0i) + r 2 sin(0 2 ) 4 h r^sin(0^). 

If we assume that the ith step is independent of the (i + l)th step for all i, 
then the probability density to find the particle at x — > x 4- dx and y — > y 4- dy 
after N steps is 

r2n roo poo 

— ) d9i • • ■ dd N \ dr\ • • • dr N P(n) x • • • x P(r N ) 

Jo Jo Jo Jo 

x £(x - r\ cos (0i ) — r 2 cos (0 2 ) — r N cos (0^)) 

x S(y — n sin (0i) - r 2 sin (0 2 ) r N sin {6n))- 


(4.164) 
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Fig. 4.16. One realization of a Rayleigh-Pearson random walk for the case when all 
steps are of unit length (a = 1). The total number of steps is N — 2000. The walker 
starts at (x = 0,y = 0). 


In Fig. 4.16 we show a two-dimensional Rayleigh-Pearson random walk for 
the case when the step lengths r z (i = 1 , . . . , JV) are identically distributed with 
probability density P{ri) — P{r) = 6{r — 1). For this case, the walker always 
takes steps of unit length, although the direction of the steps is completely 
random. 

In Fig. 4.17 we show a Rayleigh-Pearson random walk for the case when the 
step lengths r are variable and probability density P(r) is given by a Weierstrass 
probability density: 


1 OO I 

P(r)= — Y J -„W-b"m- (4.165) 

n=0 M 

Note that the step lengths are all positive. With this distribution of step 
lengths, there is a small chance that at a given step the walker will take a 
very long step. Therefore the behavior of this random walk is completely 
different from that of a binomial random or simple lattice random walk. 
Figure 4.17 is an example of a discrete Levy flight. For 0 < fi < 2 and in 
both two and three spatial dimensions the random walk is transient [151- 
Therefore, as N — > oo the random walk will form fractal-like clusters on 
all scales down to scale size A. In the continuum limit, it will form clusters 
on all scales. 
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Fig. 4.17. One realization of a Rayleigh-Pearson random walk for the case when the 
distribution of step lengths is given by the Weierstrass probability density. The total 
number of steps is N = 2000. The walker starts at (x = 0,y = 0). The sequence of 
angles is the same as in Fig. 4.16 (a) The complete walk, (b) Magnification of the upper 
section of (a), (c) Magnification of hatched box in (b). 


It is also possible to obtain a Levy-type random walk by haviing the walker 
move on a lattice which has self-similar structure. Such types of random walks 
are discussed in Ref. [19]. 


► S4.E. General Form of Infinitely Divisible 
Distributions 

The definition of an infinitely divisible distribution function is given in Section 
S4.B. For the case of infinitely divisible distributions with finite variance, A. N. 
Kolmogorov [6, 20] has given a general form of the characteristic function, 
called Kolmogorov’s formula, which is also unique. R Levy [21, 22] and A. 
Khintchine [23] have generalized Kolmogorov’s result to the case of infinitely 
divisible distributions with infinite variance. Their formula is called the Levy- 
Khintchine formula. In Sections S4.E.1-S4.E.2 below, we describe these two 
formulas. 
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► S4.E.1. Levy-Khintchine Formula [22, 23] 

The most general form of an infinitely divisible characteristic function has the 
form 


f{k) = exp 


ika + 


a 


,ikx 


- 1 - 


ikx \ 1 + x 2 


1 + jc^ 


dG(x) 


(4.166) 


where a is a constant and G(x) is a real, bounded, nondecreasing function of jc 
such that G(— oo) = 0. The integrand at x = 0 is 



- 1 - 



k l 
2 ‘ 


(4.167) 


Both a and G(x) are uniquely determined by f(k). Equation (4.166) is called the 
Levy-Khintchine formula. 

It is fairly easy to see that the Levy-Khintchine formula is infinitely 
divisible. We can write Eq. (4.166) in the form 


f(k) = 

where o 2 — G{ 0 + ) — G(0~) and 

ikx 


I(k ) = lim 


e — *0 


\x\>e 


e ikx - 1 - 


1 +x : 


1 

“~2 dG(x) 


(4.168) 


Ss % ( e!b ‘ ~ 1 " t+^) ^ {G(x,) ~ 


(4.169) 


Thus f{k) is the product of a Gaussian characteristic function and the limit of a 
product of Poisson characteristic functions, all of which are infinitely divisible. 
Therefore, f(k) is infinitely divisible. 

The quantities a and G(x) can be obtained if we know Fn(x), the distribution 
function associated with (f(k)) l ^ N . Let us write 


In /(*) = lim N(f N (k) - 1) 


N — »oo 


= lim (ika^ 4- ( e h 

N —> >oo V 


ikx \ 1 +X 2 (4.170) 


But N(f N (k) - 1) = N (e ikx - 1 )dF N (x), so that 


f 00 jc Nx 2 

a N = N J t + x2 dF N (x) and dG N (x) = ^ dF N (x) 


(4.171) 
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■ EXERCISE 4.15. Show that the characteristic function, f(k) = 
[(1 - b)/(l + a)][(l + ae lk )/{\ - be~ ik )\(0 < a^b < 1) is not infinitely 
divisible. 

Answer: Note that 

ln(/(&)) = ln(l — b) + ln(l + ae lk ) — ln(l + a) — ln(l — be~ lk ) 

= - *) + - 1)| 
n=l L -l 

( °° I _l. , 2 f 00 1 

( e~ lku — 1) — ^ — dG(u ) — ik -dG{u). 

—00 ^ J— 00 ^ 

We can satisfy the above equation if we let 

00 r y.un nn n 

G («) = Y + n ) - 0 ( M ~ n ) 


^/M + (-l)V 
Y\ 1 + n 2 


But G(u ) is not a monotonically increasing function of u. Therefore, f(k ) is 
not infinitely divisible. 


It is easy to see that for the Gaussian characteristic function, Eq. (4.123), 
a — a and G(x) = <7y@(*). For the Poisson characteristic function, Eq. (4.126), 
ex = a + Xh/(\ + h 2 ) and G(x) = Xh 2 /(l + h 2 )Q(x — h ). For the Cauchy 
characteristic function, Eq. (4.130), a = b and G(x) = [a/ 7r) tan -1 (*) + (a/ 2). 

► S4.E.2. Kolmogorov Formula [6, 20] 

Kolmogorov was the first to find a general formula for infinitely divisible 
distributions, although his formula only applies to distributions with finite 
variance. He obtained 

f 00 1 

f Y (k) = exp ijk + {e lku - 1 - iku}^dK(u) , (4.172) 

J-00 ti 

where 7 is a constant and the function K(u) [whose differential, dK(u), appears 
in Eq. (4.172)] is a nondecreasing function with bounded variation. Note that 
iimjfc-o (df/dk) = 17 and lim*_+ 0 (d 2 //^ 2 ) = ~7 2 - K(oc) . Thus the first mo- 
ment is (y) = 7 and the standard deviation a = y/K(oo). Equation (4.172) is 
called Kolmogorov’s formula and it is unique. 
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The Kolmogorov formula can be obtained from the Levy-Khintchine 
formula if we let K(x) = /^(l +y 2 )dG(y) and 7 = a + J_^yrfG(y). If we 
compare Eq. (4.172) to the characteristic function for a Poisson distribution [cf. 
Eq. (4.126)], the function K(u ) is proportional to the distribution function of 
spacings between realizations of the stochastic variable, Y (spacings between 
discrete jumps in F Y (x)). For example, for the Poisson distribution, 
K(u) = \h 2 Q(u - h ) and dK(u) = A h 2 6(u — h ). For the Gaussian distribution, 
Eq. (4.123), the realizations of Y form a continuum (zero spacing between 
realizations) and K{u ) = <jyQ(u) (we plot K(u) for the Gaussian and Poisson 
distributions in Fig. 4.18). The Kolmogorov formula does not apply to the 
Cauchy distribution, since the Cauchy distribution has infinite variance. 


■ EXERCISE 4.16. Consider the characteristic function, f{k) = 
(1 —b)/{ 1 —b lk )( 0 < b < 1) (cf. Exercise 4.15). Find the Kolmogorov 
function, K(u). 


Answer: First note that 

00 h m 00 00 1 m 00 lot 

In f(k) = Y, ~ ( eikm ~ 1 ) = ik bm + ~ + H ~ ( e ‘ km ~ 1 “ ikm) 

1 171 i 4 17 % . 17 % 

m—l m= 1 m= 1 


0-b) 


w um 

+ — 1 -ikm). 


By comparison with Eq. (4.172), we find dK(u ) = Y2m=i f nb m 8(u — m) and 
K{u) = mb m Q{u - m). Thus, 7 = b/{\ - b) and a 2 = rnb m = 
b/( 1 — b) 1 . Furthermore, K(u) is a nondecreasing function of u and has 
bounded variation. Thus, conditions of Kolmogorov’s formula are satisfied 
and f(k) is an infinitely divisible characteristic function. 
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PROBLEMS 

Problem 4.1. A bus has nine seats facing forward and eight seats facing backward. In 
how many ways can seven passengers be seated if two refuse to ride facing forward and 
three refuse to ride facing backward? 
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Problem 4.2. Find the number of ways in which eight persons can be assigned to two 
rooms (A and B) if each room must have at least three persons in it. 

Problem 4.3. Find the number of permutations of the letters in the word 
MONOTONOUS. In how many ways are four O’s together? In how many ways are 
(only) 3 O’s together? 

Problem 4.4. In how many ways can five red balls, four blue balls, and four white balls 
be placed in a row so that the balls at the ends of the row are the same color? 

Problem 4.5. Three coins are tossed, (a) Find the probability of getting no heads, (b) 
Find the probability of getting at least one head, (c) Show that the event “heads on the 
first coin” and the event “tails on the last two coins” are independent, (d) Show that the 
event “only two coins heads” and the event “three coins heads” are dependent and 
mutually exclusive. 

Problem 4.6. Various six digit numbers can be formed by permuting the digits 666655. 
All arrangements are equally likely. Given that a number is even, what is the probability 
that two fives are together? (Hint: You must find a conditional probability.) 

Problem 4.7. Fifteen boys go hiking. Five get lost, eight get sunburned, and six return 
home without problems, (a) What is the probability that a sunburned boy got lost? (b) 
What is the probability that a lost boy got sunburned? 

Problem 4.8. A stochastic variable X can have values x = 1 and x = 3. A stochastic 
variable Y can have values y = 2 and y — 4. Denote the joint probability density 
P X y(x,y) = X),-=i ,3 IZ;= 2,4 ~ 0% _ j)- Compute the covariance of X and Y for 
the following two cases: (a) pi }2 — P\a = P 3,2 = Pia = \ and (b) pi i2 = Pi a = 0 and 
PiA ~ P3,2 — i- F° r each case, decide if X and Y are independent. 

Problem 4.9. The stochastic variables X and Y are independent and Gaussian 
distributed with first moment (x) = (y) = 0 and standard deviations ax = cr Y = 1 . Find 
the characteristic function for the random variable Z = X 2 + Y 2 , and compute the 
moments ( z ), (z 2 ), and (z 3 ). Find the first three cumulants. 

Problem 4.10. A die is loaded so that even numbers occur three times as often as odd 
numbers, (a) If the die is thrown N = 12 times, what is the probability that odd numbers 
occur three times? If it is thrown N = 120 times, what is the probability that odd 
numbers occur thirty times? Use the binomial distribution, (b) Compute the same 
quantities as in part (a) but use the Gaussian distribution. [Note: For parts (a) and (b) 
compute your answers to four places.] (c) Compare answers for (a) and (b). Plot the 
binomial and Gaussian distributions for the case N = 12. 

Problem 4.11. A book with 700 misprints contains 1400 pages, (a) What is the 
probability that one page contains 0 misprints? (b) What is the probability that one page 
contains 2 misprints? 

Problem 4.12. Three old batteries and a resistor, R, are used to construct a circuit. Each 
battery has a probability p to generate voltage V = vo and has a probability 1 — P to 
generate voltage V = 0. Neglect any internal resistance in the batteries. Find the average 
power, (V 2 ) /R, dissipated in the resistor if (a) the batteries are connected in series and 
(b) the batteries are connected in parallel. In cases (a) and (b), what would be the 
average power dissipated if all batteries were certain to generate voltage V = vo? (c) 
How would you realize the conditions and results of this problem in a laboratory? 
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Problem 4.13. Consider a random walk in one dimension. In a single step the 
probability of a displacement between jc and x + dx is given by 

1 / (x - a) 2 \ 


P(x)dx = 


V2 


na z 


exp 


2cr 2 


After N steps the displacement of the walker is S = X\ -f h X N , where X, is the 

displacement during the ith step. Assume the steps are independent of one another. After 
N steps, (a) what is the probability density for the displacement, S, of the walker and (b) 
what are the average displacement, (5), and the standard deviation of the walker? 

Problem 4.14. Consider a random walk in one dimension for which the walker at each 
step is equally likely to take a step with displacement anywhere in the interval 
d - a^x^d + a, where a < d. Each step is independent of the others. After N steps the 

walker’s displacement is S = Xi 1- X N , where X, is the displacement during the ith 

step. After N steps, (a) what is the average displacement, ( S ), and (b) what is his 
standard deviation? 


Problem 4.15. Consider a random walk for which the probability of taking of step of 
length, x — * x + dx, is given by P{x)dx = £ (a/fx 2 + a 2 ))dx. Find the probability density 
for the displacement of the walker after N steps. Does it satisfy the Central Limit 
Theorem? Should it? 


Problem S4.1. Consider a random walk on an infinite one dimensional (d = 1 ) lattice 
where the walker starts at site / = 0. (a) Compute the generating functions U(z, l) and 
V(z, /)• (b) Compute the probability to reach site, /, during the random walk, (c) 
Compute the probability that the walker reaches site l — 0 and site / = 1 after s = 3 and 
after s = 4 steps. Compute the probability that the walker reaches sites 1 = 0 and site 
/ = 1 for the first time after s = 3 and after 5 = 4 steps, (d) Compute the average number 
of steps needed to reach site, 1=1, during the random walk. Explain your result. 

Problem S4.2. Consider a random walk along nearest neighbors on an infinite, periodic 
face centered two-dimensional (d = 2) square lattice. A unit cell is shown in Fig. 4.19. 
Assume that the lattice spacing is a = 2 and that the walker starts at site l\ = 0, l 2 = 0. 
(a) By counting paths (draw them), find the probability, F 4 (0), that the walker returns to 
the origin after four steps. Find the probability, 24(0), that the walker returns to the 
origin for the first time after four steps, (b) Compute the generating function, U(z, 0). 
Use this result to compute P 4 (0) and g 4 ( 0). (c) Compute the escape probability. 

Problem S4.3. Consider a random walk along nearest neighbors on an infinite, periodic 
body-centered three-dimensional (d = 3) cubic lattice. A unit cell is shown in Fig. 4.20. 
Assume that the lattice spacing is a = 2 and that the walker starts at site, 



a 


Fig. 4.19. Unit cell for a face-centered square lattice. 
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Fig. 4.20. Unit cell for a body-centered cubic lattice. 


(/i = 0,/2 = 0 , h = 0). Compute the generating function. f/(l,0), and the escape 
probability. [Hint. Make a change of variables, x, = tan(</>,/2). Then change to 
spherical coordinates, jci = rsin( 0 )cos(<^),;c 2 = rsin(0)sin(0), and jci = rcos(0). 
Replace the integration over r by an integration over t, where t = 

rsin (6) \J \ cos(i9) sin (|) . The three integrations over t,6, and <j> then separate.] 

Problem S4.4. Consider a Rayleigh-Pearson random walk in which the walker has a 
probability P{r)dr — rdr/(\ + r 2 ) 3 ^ 2 to take a step of length r — ► r + dr. If the walker 
starts at the origin, compute the probability Pn(R) to find the walker within a circle of 
radius R after N steps. 

Problem S4.5. A stochastic variable X has characteristic function f x (k) = 1/(1 + k 2 ). 
(a) Compute the probability density, Px{x), the average, (x), and variance, (b) Write 
Kolmogorov’s formula for this characteristic function. What is K(u)l Is the stochastic 
variable X infinitely divisible? 

Problem S4.6. Consider the gamma distribution function for stochastic variable X: 


F(x) = 



n/2 | <.* 

iW2)Jo 


d yyM )- 1«->/2 


for x > 0 


and F(x) = 0 for x < 0 (n is an integer), (a) Find the characteristic function for this 
distribution, (b) Write the Levy-Khintchine formula for the characteristic function. That 
is, find a and G(x). Is X infinitely divisible? [Hint: First find F N (x) and use it to find 
G n (x) and take the limit.] 
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STOCHASTIC DYNAMICS AND 
BROWNIAN MOTION 


S.A. INTRODUCTION 

Now that we have reviewed some of the basic ideas of probability theory (cf. 
Chapter 4), we will begin to study more complicated situations in which 
probability can evolve in time. In Chapter 4 we already considered some simple 
random walks which are elementary stochastic processes that can be built from 
repeated application of the same transition process at discrete time intervals. In 
this chapter we will derive differential equations of motion for the evolution of 
probability distributions and densities in time. The level at which we discuss 
this time evolution is phenomenological because we will not concern ourselves 
with the relation of these equations to Newtonian mechanics. That will be the 
subject of Chapter 6. Also, in this chapter we will limit ourselves to Markov 
processes, which are stochastic processes with a very limited memory of 
previous events. 

The equation which governs the stochastic dynamics of Markov processes is 
the master equation. It is one of the most important equations in statistical 
physics because of its almost universal applicability. It has been applied to 
problems in chemistry, biology, population dynamics, laser physics, Brownian 
motion, fluids, and semiconductors, to name a few cases. As a system of 
stochastic variables evolves in time, transitions occur between various 
realizations of the stochastic variables. Because of these transitions, the 
probability of finding the system in a given state changes until the system 
reaches a final steady state in which transitions cannot cause further changes in 
the probability distribution (it can happen that the system never reaches a steady 
state, but we will be most concerned with cases for which it can). To derive the 
master equation, we must assume that the probability of each transition depends 
only on the preceding step and not on any previous history. This assumption 
applies to many different types of physical system found in nature. 

One of the simplest types of Markov processes that we will consider is the 
Markov chain. Markov chains are processes in which transitions occur between 
realizations of discrete stochastic variables at discrete times, as in the case for 




230 


STOCHASTIC DYNAMICS AND BROWNIAN MOTION 


the random walks we treated in Chapter 4 using simple probability theory. In 
this chapter we will formulate such problems in a more elegant manner. We will 
get a very good picture of how stochastic systems can decay to a unique steady 
state for the case in which the transition matrix is “regular” and we can 
introduce the concept of ergodicity, which we shall come back to in Chapter 6. 
The dynamics of Markov chains is totally governed by a “transition matrix” 
which is real and generally is nonsymmetric. We shall show how the stochastic 
dynamics can be expressed in terms of a spectral decomposition of the 
probability in terms of the left and right eigenvectors of the transition matrix. 

If the time interval between events can vary in a continuous manner, then we 
must introduce the master equation which is a differential difference equation 
governing the time evolution of the probability. For the very special case when 
the transition rate between realizations exhibits detailed balance, solutions of 
the master equation can also be expressed in terms of a spectral decomposition. 
Detailed balance holds for many types of systems which are near thermal 
equilibrium or do not have internal probability currents as they reach the long 
time state. Some examples for which detailed balance holds can include 
chemical reactions near equilibrium, electronic transitions, and some random 
walks. The master equation for birth-death processes, which include chemical 
reactions and population dynamics, is derived in the section on special topics. 

One often encounters systems with many degrees of freedom but for which a 
few of the degrees of freedom evolve on a much slower time scale than the 
others. An example of this is Brownian motion, in which a relatively massive 
particle, such as a grain of pollen, is immersed in a fluid, such as water. The 
grain of pollen is observed to undergo rapid, apparently random, agitated 
motion. Brownian motion, which was first made popular by the work of 
biologist Robert Brown, was used by Einstein as evidence of the atomic nature 
of matter. Indeed, it only occurs because of the discrete particulate nature of 
matter. A phenomenological theory of this motion can be obtained by writing 
Newton’s equation of motion for the massive particle and including in it a 
systematic friction force and a random force to mimic the effects of the many 
degrees of freedom of the fluid in which the massive particle is immersed. This 
gives rise to the Langevin equation of motion for the massive particle. 

Given a Langevin equation for a Brownian motion process, we can obtain an 
equation for time evolution of the probability distribution in phase space of the 
Brownian particle, called the Fokker-Planck equation. In the sections devoted 
to special topics we will derive the Fokker-Planck equation and we will solve it 
for Brownian motion with one spatial degree of freedom in the presence of 
strong friction. We shall also mention the interesting connection of Fokker- 
Planck dynamics to chaos theory when one considers more than one spatial 
degree of freedom. For cases in which the master equation cannot be solved 
exactly, it can sometimes be approximated by a Fokker-Planck equation. We 
will discuss some of those cases in this chapter. In later chapters we will use the 
concepts developed here to describe other types of Brownian motion in many 
body systems. 
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5.B. GENERAL THEORY [l^t] 

Let us consider a system whose properties can be described in terms of a single 
stochastic variable Y. Y could denote the velocity of a Brownian particle, the 
number of particles in a box, or the number of people in a queue, to name a few 
of many possibilities. 

We will use the following notation for the probability density for the 
stochastic variable Y: 

P\(yi, t\) = (the probability density that the stochastic 

variable Y has value yi at time fj); (5.1) 

Pi(yu h\yit h) = (the joint probability density that the 
stochastic variable Y has value y\ at 
time t\ and y 2 at time * 2 ); (5.2) 

P„(yi, t\\y 2 , t 2 \ . . . \yn,t n ) = (the joint probability density that the 

stochastic variable Y has value y\ at 

time *i,y 2 at time t 2 , . . . ,y„ at time t n . (5.3) 

The joint probability densities are positive: 

P n > 0; (5.4) 

they can be reduced: 

r 

j^n(yi, h;y 2 , t 2 ,;...;y n ,t n )dy n = P n -i(yi,h;y2, t 2 \ . . . ;y„_i, (5.5) 

and they are normalized: 

\pdyu ‘t )dyi = 1 . (5.6) 

In Eqs. (5.5) and (5.6) we have assumed that Y is a continuous stochastic 
variable. However, if Y is discrete we simply replace the integrations by 
summations. We can introduce time-dependent moments of the stochastic 
variables, (yi(h)y 2 (t 2 ) x • • • x y n (t„)). They are defined as 

* 

{ yi(h)y 2 (ti)---yn(tn )) = yw ■ -ynPniyu h-,...\y n ,t n )dy l ---y n ( 5 . 7 ) 

and give the correlation between values of the stochastic variable at different 
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times. A process is called stationary if 

Pn{yu hiyi, t 2 -,...]y n ,t„) — P n {yu h + r; y 2t t 2 + t; . . . ;y„,t n + t) (5.8) 

for all n and r. Thus, for a stationary process 

Pi(yuh) = P l (y l ), (5.9) 

and (yi(*i)y 2 (* 2 )) depends only on \t x — f 2 1 — the absolute value of the 
difference in times. All physical processes in equilibrium are stationary. 

We shall also introduce a conditional probability: 

]°i I i(yi, t\ | y 2 , t 2 ) = (the conditional probability density for the 

stochastic variable Y to have value y 2 at time (5.10) 
t 2 given that it had value yi at time t \ ) 

It is defined by the identity 

P\(yu h)P\\\{y\, ti\yi, h) =Pi{yu h\y 2 , t 2 ). (5.11) 

Combining Eqs. (5.5) and (5.11) we obtain the following relation between the 
probability densities at different times: 

P\(yi, h) = |/ > i(yi, /i)i®i|i(yi, h\y 2 , t 2 )dy x (5.12) 

where the conditional probability P\\\{yi, h\y 2 , t 2 ) has the property 

Pi\i{yu h\y 2 , t 2 )dy 2 = l, (5.13) 

as can be demonstrated easily. 

We can also introduce a joint conditional probability density as follows: 

Pk\i(yu h; - ;y*,f*b*+i,&+i, ; • • • ; ?*+/>**+/) 

= (the joint conditional probability density that the stochastic variable 
Y has values ( y* + i , t k +i ; . . . ; yic+i, tk+i ) given that ( y x , h 5 • . • 5 h) 
are fixed). (5.14) 

The joint conditional probability density is defined as 


Pk\i(yu h \ • • • \yk,tk\ yk+\ , h+i ; • • ; h+i) 

Pk+ i(yi, ti-,...]yk,tk]yk+htk+u---',yk+ht k+ i) - 
Pk(y 1, h;...;y k ,t k ) 


( 5 . 15 ) 
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The joint probability densities are important when there are correlations 
between values of the stochastic variable at different times — that is, if the 
stochastic variable has some memory of its past. However, if the stochastic 
variable has memory only of its immediate past, then the expressions for the 
joint probability densities and the joint probability densities simplify 
considerably. 

If the stochastic variable has memory only of its immediate past, the joint 
conditional probability density P„_ i j i ( yi , fj ; . . . ; y„-i ,t n -i\y n , t n ), where 
t\ < t 2 < • • ■ < t n , must have the form 

P„-m(yi, ft; • • • \yn- 1 ) tn- 1 |y«, t n ) = 1 1 ( i > t n -\ |y n , t n ). (5.16) 

That is, the conditional probability density for y n at t n is fully determined by the 
value of y„_i at t n - \ and is not affected by any knowledge of the stochastic 
variable Y at earlier times. The conditional probability density 
P, I , ( _V! , t\ |y 2 , t2) is called the transition probability. A process for which 
Eq (5.16) is satisfied is called a Markov process. A Markov process is fully 
determined by the two functions P\(y,t) and Pi|i(yi, ti|y 2 , * 2 )- The whole 
hierarchy of probability densities can be constructed from them. For example, 

/Myi, h;y 2 , t 2 ;y 3 , t 3 ) = P 2 (y 1 , h;y 2 , t 2 )P 2l i{y u h;y 2 , t 2 \y 3 , t 3 ) 

= P\(yu h)P\\\{yu h\y 2 , r 2 )Pi|i(y2, t 2 \y 3 , t 3 ). 

(5.17) 


If we integrate Eq. (5.17) over y 2 assuming t\ < t 2 < t 3 , we obtain 


Pi{y\, tv,y 3 , t 3 ) = Pi(yi, t \ ) 


Pl|l(yi, 'i !)’2, <2)/ ) lil(>’2, <2|.V3, h)dy 2 . 


(5.18) 


If we now divide Eq. (5.18) by Pi(yi, ti), we obtain 

^i|i(yi, ? 1 1 y 3 , t 3 ) = J P\\i{y\, h\y 2 , t 2 )P^i{y 2 , t 2 \y 3 , t 3 )dy 2 . (5.19) 

Equation (5.19) is called the Chapman-Kolmogorov equation. Notice that we 
have broken the probability of transition from yj, t\ to y 3 , t 3 into a process 
involving two successive steps, first from yi, t\, to y 2 , t 2 and then from y 2 , t 2 to 
y 3 , t 3 . The Markov character is exhibited by the fact that the probability of the 
two successive steps is the product of the probability of the individual steps. 
The successive steps are statistically independent. The probability of the 
transition y 2 , t 2 — > y 3 , t 3 is not affected by the fact that it was preceded by a 
transition yi, t\ —> y 2 , t 2 . 

In the next section we will use these equations to study some of the simplest 
Markov processes, namely those of Markov chains. 
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5.C. MARKOV CHAINS [2-6] 

One of the simplest examples of a Markov process is that of a Markov chain. 
This involves transitions, at discrete times, between values of a discrete 
stochastic variable, Y. Let us assume that Y has realizations {y(n)}, where 
n = 1,2, . . . ,M, and that transitions occur at times t = sr, where 
£ = 0 , 1 , ... ,oo. We let P{n,s) denote the probability that Y has realization, 
y(n), at “time” t = s. We let Pi|i(«i, s\ j« 2 , £2) denote the conditional 
probability that Y has realization yk) at time S 2 , given that it had realization 
y(«i) at time s\. The two quantities P(n,s ) and 1 1 («i, -Si |«2, ^2) completely 
determine the evolution of a Markov chain. 

We can write Eq. (5.12) for the probability P(n, s ) in the form 

M 

P(n, s + 1) = E'K s)Px\i(m, s\n, s + 1) (5.20) 

m=l 

and from the Chapman-Kolmogorov equation, (5.19), we can write the 
conditional probability in the form 

M 

P\\i(n 0 , sok s+ 1) = 5^Pi|i(«o, s 0 \m, s)P l \i{m , s\n, s + 1). (5.21) 

m—l 

The quantity P\\i(m, s|/i, s + 1) is called the transition probability. It is the 
conditional probability that if the system is in state m at time s, it will jump to 
state n in the next step. It therefore contains all necessary information about the 
basic transition mechanism in the system. 

Let us now introduce the transition matrix Q(s), whose (m, n)th element is 
the transition probability 

Qm,n(s) =Pni(m, s\n, s + 1). (5.22) 

In this section we will consider Markov chains for which the transition matrix is 
independent of time, Q(j) = Q. In Section S5.A, we consider the case when the 
transition matrix depends periodically on time Q(s) = Q (s + N). 

5.C.I. Spectral Properties 

For the case of a time-independent transition matrix, we have 

Qm,n =Pi|iK Ok 1) =Fi|i(m, s\n, s+ 1). (5.23) 

The formal solution of Eq. (5.21) for the conditional probability can be obtained 
by iteration and is given by 

P\\\{m, 5 0 |/t, s) = {Q s ~ So ) m , n , (5.24) 
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where the right-hand side denotes the (m, n ) th element of the matrix Q raised to 
the s - so power. The probability P(n, s) is given by 

M 

P(n, s) = Y,P(m, 0)(Q S )„,„- (5.25) 

m= 1 

It is useful at this point to introduce Dirac vector notation. We will let 
P(n , 5 ) = (p(s) |«), and P\\\(m, sojw, s) = (m|P(i'o|5') |n). Here (p(^) | is the 
probability vector and P(so |s) is the conditional probability matrix. The left and 
right states, (n | and | n) respectively, denote the possible realizations of the 
stochastic variable Y and are assumed to be complete, J2n=\ l n )( n l = 1, and 
orthonormal, ( m\n ) = The probability P(n, s ) = (p(s) |n) can be thought 
of as the nth component of a row vector. We shall now express P(so | s) and 
(p(s) | in terms of eigenstates of the transition matrix, Q. 

The transition matrix, Q, in general is not a symmetric matrix. Therefore, the 
right and left eigenvectors of Q will be different. The eigenvalues, A, 
(i = 1, ... ,M), of Q are given by values of A which satisfy the condition that 
the determinant of the matrix, Q- AI, be zero. That is, 

det | Q — AI ( = 0 (5.26) 

(I is the unit matrix). If Q is an M x M matrix, it will have M eigenvalues, 
which may or may not all be real. Corresponding to the ith eigenvalue, there 
will be a left eigenstate, (x» | » and a right eigenstate, |t />,). The left eigenstate 
satisfies the eigenvalue equation, (x, | A ,• = (x/ 1 Q, or 

M 

Xi(n)Xi = ^2 Xi{m)Qm,n, (5.27) 

m= 1 

where x«( n ) = (Xi I n ) and Q m n = (m\Q\n). The right eigenstate satisfies the 
eigenvalue equation, A, |^,) = Q|^,), or 

M 

A i^i(n) = ^2 (5.28) 

m= 1 


where ^,(n) = (n\^i). 

We can prove that the left and right eigenvectors of Q are complete and 
orthogonal. Let us first prove orthogonality. Consider the eigenvalue equations 
'VW = Q I '0;) and (x,|A; = (x/|Q. Multiply the first equation by (x,| and 
the second equation by \ifjj), and subtract the first equation from the second. 
We get 


(•V - = (xi I Q i 4’i) - (xiIQIV’j) = o. 


(5.29) 
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Thus, if A i ± A j, then {%,- 1 ipj) = 0. If A, = Ay, then (xi \ ^i) can be nonzero. We 
shall always normalize the left and right eigenvectors so that 

(XiWj) = hj- (5.30) 

Let us next consider completeness. Let us expand a probability vector (p | , in 
terms of left eigenvectors so that (p| = Y^f=\ a i(Xi I • We then multiply on the 
right by | ipj) and use the orthonormality condition, Eq. (5.30). We find 
otj = (p | ^}j) so that 

M 

<p| = 5Z(p|^)(xil 

i=l 

This expresses the completeness condition 

M 

EW<X(|=I (5.31) 

i=l 

for the left and right eigenstates. 

We next obtain some general properties of the eigenvalues, A,. First, 
we will prove that | A,- 1 < 1 for all i. To do this, first take the absolute value 
of Eq. (5.28), | A,^ f (n) | = | Qn,m^i{m) | • Both A,- and ipi(n) are 
complex (or real) numbers while Q m n is a real number. It is easy to show 
that | A,"0,(n) | = | A/| |t/>,(n) |. We next use the triangle inequality, 

I i x i I ^ Zti \ x i I , where Xj is a complex number. Thus, we find 

M 

|A i ||W«)K£e», m |V’ i MI- (5-32) 

m=l 

Let us assume that | ifti(n) | for all n, where C is a positive constant. Then 

M M 

^TQn,m\i)i{ m )\ ^C^2Qn,m = C. ( 5 - 33 ) 

m = 1 m = 1 

Assume that for n = no, \ipi(no)\ = C. Then Eqs. (5.32) and (5.33) give 
| A,|C^C and 

| A,-| < 1. (5-34) 

Thus, all eigenvalues of the transition matrix have a magnitude less than or 
equal to one. 

Now that we have established that | A, j < 1, we can also prove that A, = 1 ,s 
always an eigenvalue. Let us first note that the particular choice, i>i(n) = 1 f° r 
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all n, is a right eigenvector with eigenvalue, A, = 1 [cf. Eq. (5.28) and note that 
H.iQ nm — 1]. The corresponding left eigenvector, {x;|, with eigenvalue, 
Aj = 1, must satisfy the eigenvalue equation 

M 

Xi{n) = ^2 Xi{m)Qm,n- (5.35) 

m= 1 

Equation (5.35) is the equation for a stationary probability vector, 
Xi{n) = Psr{n ) — that is, one that does not change with time. 

We can expand the transition matrix, Q, in terms of its left and right 
eigenvectors. Consider the left eigenvalue equation, (x, | A, = (xi | Q. Multiply 
from the left by the right eigenstate, | -0,) , and sum over i. If we use the 
completeness relation, Eq. (5.31), we obtain Q = Y2L\ A/ 1 V>j) (Xi 1 or 

M 

Qm,n = ^2x i ip i (m)xi(n). (5.36) 

i=l 

If we now combine Eqs. (5.24) and (5.36) and use the orthonormality of the left 
and right eigenvectors, we obtain 


M 

Pi|i(m, s 0 |ft, s) = '^2\ s ~ s °'tp i (m)xi(n). (5.37) 

i= 1 

Thus, we have obtained the general solution for the discrete Markov process in 
terms of the eigenvalues and left and right eigenvectors of the transition matrix 

Q 

The detailed behavior of the conditional probability depends on the structure 
of the transition matrix. There is one case that we are particularly interested in, 
and that is the case when the transition matrix is regular. The transition 
matrix Q is called regular if all elements of some power (N an integer) are 
nonzero. If Q is regular, the probability P{n , 5 ) tends to a unique stationary 
state Psr{n) after a long time. This means that there will be only one eigenvalue 
of Q with eigenvalue A = 1. Let us denote it Aj = 1. Then Eq. (5.37) can be 
written 


M 

P \\\{™ , so | «, s) = i)\{m)P ST (n) + ^2\ s i - s °ip i (m)xi{n), (5.38) 

i= 2 

where ^ (m) = 1 for all m. Since A, < 1 for 1 ^ 1, we can take the limit 5 — ► 00 
and obtain 


lim Pi 1 1 (m, s 0 1 ft, s) = i>i {1 m)Psr{n ) with ^ (m) = 1 for all m. (5.39) 

S—+OC v ' 
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Thus, for regular transition matrices, the conditional probability tends to a 
unique value for long times. Similarly, 

lim P(n, s ) = Psr{n). (5.40) 

s — ►OO 

In Exercise 5.1, we shall illustrate these ideas with an example. 

Markov chains which are governed by regular transition matrices are 
ergodic. That is, it is possible, starting from any one realization of the stochastic 
variable, to reach every other realization through transitions during the 
evolution of the system. This is no longer true if, for example, the transition 
matrix is block-diagonal. Then some realizations will be decoupled from others. 
When the transition matrix is block-diagonal, the system can have multiple 
long-time states. 


■ EXERCISE 5.1. Consider two pots, A and B, with three red balls and 
two white balls distributed between them so that A always has two balls and 
B always has three balls. There are three different configurations for the pots, 
as shown in the figure below. We obtain transitions between these three 
configurations by picking a ball out of A at random and one out of B at 
random and interchanging them, (a) Compute the transition matrix Q and 
the conditional probability so\n, s). (b) If initially P,(l, 0) = 1, 

Pi (2,0) = 0, and Pi (3, 0) = 0, compute the probability Pi(n, s) (n = 1,2, 
and 3) at time s. (c) Assume that the realization, y(n) = n. Compute the first 
moment, (y(s)), and the autocorrelation function, (y(0)y(j)), for the same 
initial conditions as in part (b). 


■® 


® 


®. 


® 








A B 

1 J 

yi 


A B 

I I 

y* 


A 

t_ 


B 

\ 

Yi 


Answer: 

(a) Inspection shows that we can make the following transitions from yi 
to yj with transition probability, Qy, such that <2n =0, Qn = 
2i3 = o, 221 = h 222 = 5, 223 = 5, 231 = 0, 232 = | and 233 = 5' 
The transition matrix and its square are given by 

0 1 0 \ 

ill 

6 2 3 

0 i \) 

respectively. The transition matrix is regular so this system has a 



and Q 2 = 
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unique long-time stationary state. The eigenvalues of Q are Aj = 1, 
A 2 = g, and A 3 = - 5 . The right eigenstates can be written 


l<M = 



1 ^ 2 ) = 



1 ^ 3 ) = 



The left eigenstates can be written 

(Xil = (ft ft ft)> (X2I = (“ft ~ft ft)j 

<Xsl = (5 -5 s)- 

We can now construct the conditional probability matrix 

_l_ A J_\ 

10 10 10 \ 

v(s 0 \s) =Y t K~ s °m(xi 


i=l 


+ G) 


ns-Jo 


+ (~l) 


1\5--S0 



Note that all the probabiliy is carried by the long-time state. 

(b) Denote the probability vector as (p($) | = (P(l, .s), P( 2, 5 ), P( 3, s)). 
Then (p(s) | = (p(0) j P(0 1 ^). If we assume an initial condition 
(p(0) | = (1,0,0), then the probability vector at time s is 


(pWI = 


r i 

10 

3_ 

To 


2 /1 


1 


+ 5 U 1 + ~ 1_ " 


3 2 /I 

5 + 5 V6 1 


mm- 


(c) The first moment is given by (y(,s)) = Y^ n =i nP \{ n i s )• From the 
expression for (p(j) | we obtain 

11 6 (\ 


(y(5))= T"5U 

Finally, we compute the autocorrelation function, 

3 3 

(y( 0 )y(s)) = °) p i | i(w, 0 | n, s). 

m= 1 n= 1 
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We obtain 


MOM*)) = y + ^ 



s 


Note that the eigenvalue A 3 = — \ does not contribute to (y(s)) and 
(y(O)y(s)). However, if we choose the dependence of y(n ) on n 
differently, it will, in general, contribute to these. 


5.C.2. Random Walk 

The problem of random walk on a lattice which was considered in Sections 4.E 
and 4.F can be formulated as a Markov chain of transition probabilities. For 
simplicity, let us consider random walk on an infinite one dimensional lattice 
with lattice spacing, A, along the jc axis, and let us assume that the time 
between steps is r. Let Pi («A, st) be the probability to find the particle at point 
x = nA after s steps. Then 


00 

Pi(nA, (s+ 1)t) = ^ P\(m A, st)P\ \ i(m A, st \ n A, (s + l)r), (5.41) 

m~- 00 


where P\\\{m A, sr\n A, ( s + l)r) is the transition probability to go from site 
x = m A to site x = n A in one step. 

As a specific example, let us consider the case in which the random walker 
has an equal probability to go one lattice site to the left or right during each 
step. Then the transition probability is 

P} 1 1 (w A, | n A, (5 + l)r) = 1 (5.42) 

and Eq. (5.41) takes the form 

Pi(«A, ( 5 + l)r) = \P\{{n+ l)A,sr) +\P\{{n - 1)A,st). (5-43) 

We can obtain a differential equation for the probability P\(n A,st) in the 
continuum limit. Let us subtract P\(n A,sr) from both sides of Eq. (5.43) and 
divide by r. We then can write 

Pi (« A, st + t) — Pi ( n A, st) 
t 

A 2 [Pi (nA + A, st) + Pi (m A - A, st) - 2 Pi ( n A, sr) ] 

= 2r j_ A^ J' 


(5.44) 
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If we now let x = n A, t = sr, and take the limit A — » 0 and r — > 0 so that 
O = A 2 /2 t is finite and x and t are finite, we obtain the following differential 
equation for P(x, t ) : 


dP\{x,t) d 2 P\{x,t) 

dt dx 2 . 


(5.45) 


(cf. Egs. (4.62)-(4.64)). Equation (5.45) is a diffusion equation for the 
probability density P\{x, t ). 

Let us solve Eq. (5.45) for the case Pi(jc,0) = S(x). We first introduce the 
Fourier transform of Pi (x, t ) : 


Pi (*, 0 = 



dxPt(x, t)e ikx . 


Then Eq. (5.45) takes the form 

^M = -Djfc 2 P,( M ). (5.47) 

We can solve Eq. (5.47) to obtain 

P{k, t ) = <r“\ (5.48) 

where we have used the fact that P(k, 0) = 1 since Pi(x, 0) = 6(x). We can 
now take the inverse transform to obtain 

= lL dkeVkh = (549) 

It is easy to show that (x(t)) = 0 and (x 2 (t)) = 2 Dt. 

5.D. THE MASTER EQUATION [3, 4] 

We often need to consider processes for which the time interval between events 
can vary in a continuous and random manner, but as in Section 6.C the 
realizations of the stochastic variable Y are discrete. For such processes, we 
need to obtain a differential equation for the time dependence of the probability 
Pi(n, t). In this section we consider processes for which the elementary events 
or transitions occur over a very short time interval compared to the time during 
which the stochastic process evolves. The time evolution of such processes is 
governed by the master equation. 
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5.D.I. Derivation of the Master Equation 

To begin, let us rewrite Eq. (5.12) in the form 

M 

Pi(n, t + At) - Pi (m, t)Pi |i(m, t\n, t + At). (5.50) 

m= 1 

The differential equation for Pi (n, t) can be constructed from Eq. (5.50) if we 
note that 

dP\{n, t ) _ lim / P i (n, t + At) -Pi(n, r) \ 
dt ~ At—o \ At ) 

j m (5.51) 

= 0(^1 1 i( w » t\n, t + At) -6 m , n ). 

1 m= 1 

Since we will take the limit At — > 0, we can expand the transition probability 
Pi|i(w, t\n, t + At) in a power series in At and keep only the lowest- 
order term. In order to conserve probability at all times, its most general 
form is 


^*1 1 1 t\ n, t + At) — 6 m ; n 


M 


1 - Af^Hy/(t) 
i=i 


+ w m ,n(t)At + . . . , (5.52) 


where w m „(r) is the transition probability rate. In Eq. (5.52), w m , n {t)At is the 
probability of a transition from state m to state n during the time interval 
t — * t + At. Similarly, [1 — Wm,i{t)At\ is the probability that no transition 
occurs during the time interval t — > t + At. If we now substitute Eq. (5.52) into 
Eq. (5.51), we obtain 

dP (n t) . 

X ~Qf~ = - Pl(«, 0 W «,»(0)]* ( 5 ‘ 53 ) 

m= 1 


Equation (5.53) is called the master equation. The master equation gives the 
rate of change of the probability P\(n, t) due to transitions into the state n from 
all others states (first term on the right) and due to transitions out of state n into 
all others states (second term on the right). 

The conditional probability Pi\i(no, 0|n, t) also satisfies a master equation 

dP\\\{nty, 0\n, t) = 0 | Wj t)w m , n (t) -Pi|i(n 0 , 0|n, t)w n>m (t))], 

m= 1 

(5.54) 
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where Pi | \ (no, 0 1 n, t) is the probability to find the system in the state n at time 
t, given that it was in the state no at time t = 0. The conditional probability 
satisfies an initial condition /^(no, 0|n, 0) = <5„ jno . 

The master equation can be written in a more concise form if we introduce 
the transition matrix 


M 


W m , n (t) = w mi „(f) - 8 m>n 

n'= 1 

The master equation then takes the form 

dPi(n,t ) A , x 

V =E p '( m ' owwo. 


m=l 


(5.55) 


(5.56) 


From Eq. (5.55), we see that the transition matrix must satisfy the conditions 
W mi n^ 0 for n^m and ^ W mtfl = 0 for each m. (5.57) 

n 


Thus, the entries in the rows of W m) n must add to zero. 

We can write Eq. (5.56) in still more concise form if we introduce Dirac 
notation. We let P\ (n, t) = (p(t) | n), where (p(t) | is the probability vector, and 
Pi | i (no, to | «, t) = (no | P(to 10 |«), where P(fo 1 1) is the conditional probability 
operator. Similarly we let W mi „(t) = (m) W(t) jn). The master equation for the 
probability vector becomes 


^=(p(t)|W W> (5.58) 

and the master equation for the conditional probability operator becomes 

— = P(0|<)W(r), (5.59) 

The transition matrix, W m n {t), in general is not symmetric so its left and right 
eigenvectors will be different. However, one can often use the method of 
Section 5.C.1 to obtain a spectral decomposition of W(f) and therefore of 
(p(t) j and P(0|f). However, care must be used. There may be cases where the 
eigenvectors of W(f) do not span the solution space [7]. Then the spectral 
decomposition cannot be used. There is one type of system for which a spectral 
decomposition can always be done, and that is the case for which the transition 
rates w mjfl (t) satisfy detailed balance. 
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5.D.2. Detailed Balance [3, 4, 8] 

The transition rates satisfy detailed balance if 

P s {n)w n , m = P s (m)w Wi „, (5.60) 


where P s (n ) = lim^ooPi (n, t) is the long-time stationary probability of the 
system (we assume the transition rates, w m „, are independent of time) and is 
independent of time. P s ^ is the left eigenvector of the transition matrix, W, so 
that (p s |W = 0 where P s (n ) = (p s |n). Equation (5.60) tells us that at equili- 
brium, the flow of probability into level n from level m is equal to the flow of 
probability from level m to level n. It is useful to note that the state, P s (n), can 
be obtained from Eq. (5.60) by iterating. For example, P s ( 2) = 
^ S (l)(wi,2/W2,l), P S ( 3) = P S (2)(\V 2 , 3/w 3)2 ) = P S { I)(wi |2 /W2,l) (w^, 3 /w 3j 2 ) 
and so on. P s (l) can then be found by requiring that the probability be 
normalized to one, Yln=\ ^ s ^ =1 ‘ 

Given Eq. (5.60), we can show that the dynamical evolution of the master 
equation is governed by a symmetric matrix. Let us define 


V n ,m = 


]lAJ) Wn ’ m ~ \P°il) Wn ’ m Sn ’ m ^ w "’"' 

jp s (m) c ^ _ T , 

\ n c / \Wm,n * n,m / W m n' — 'ffi, 

V p W V 


(5.61) 


where we have use d Eq. (5.60). If we now introduce a new function, 
P(n, t) = P\(n, t)/y/P s (n), the master equation takes the form 


dP(n, t) 
dt 


M 


^P(m, t)V m>n 

m= 1 


(5.62) 


Let us again use Dirac notation to write (p(f) | n) = P(n, t) and V n , m — 
(n\\\m). Then the solution of the master equation becomes 

<P(0 1 = <P(0)|e v '. (5-63) 

Since V n , m is a symmetric matrix, it has a complete orthonormal set oi 
eigenvectors. We shall denote the eigenvalues of V by A, and left and right 
eigenvectors by (ipi \ and \ifti), respectively, where i = 0, . . . ,M — 1. The lelt 
and right eigenvectors are the same. Thus, (ipt | V = (ipt | A, and V | ipi) = A, | ipi)- 
Since the eigenvectors form a complete orthonormal set, we can write 
orthonormality condition as (ipi \ ipj) = 8 t j and the completeness condition as 

= i- 
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The solution to the master equation can now be written in terms of a spectral 
decomposition 


(p(OI = ^2(p(0)\'fpi)e Xit ('ipi\. (5.64) 

i =0 

In terms of the probability Pi (n, t), it takes the form 

M - 1 M fp77~\ 

Pi(n, t) = Q)(m\ipi)e Xit (ipi\n). (5.65) 

[=0 m= 1 V ' ' 

The eigenvalues A, must be negative or zero. Let i = 0 denote the zero 
eigenvalue, Ao = 0. Then 

P s (n) = lim Pi (n, t) = \ ^7r\ p i ( m > °) ( m I ‘M ( Vto I n ) ■ (5-66) 

^°° v p ( m ) 

In order to be consistent we must have (m j -0o) = (ip o I w ) = y/P s ( m ), for 
since Pi ( m > 0) = 1 • Thus, 


M - 1 Af 


Pi(n, t) = P s (n ) 4- \ °)H^«> V (^>}- (5-67) 

/=1 m=l V 1 V / 


In Exercise 5.2 we give an example of a random walk which obeys detailed 
balance and in Exercise 5.3 we give an example of one that does not obey 
detailed balance. 


■ EXERCISE 5.2 Consider an asymmetric random walk on an open- 
[ ended lattice with four lattice sites. The transition rates are 
| vvi i2 = vv 4 , 3 = 1, vv 2 , 3 = w 3;4 = |,w 2) i = w 3)2 = |, and w/j = 0 for all 
other transitions, (a) Write the transition matrix, W, and show that this 
system obeys detailed balance, (b) Compute V and find its eigenvalues and 
eigenvectors, (c) Write P\(n, t) for the case Pi(n, 0) = <5„ i. What is 

; Pi (2,0? 


Answer: 


(a) The transition matrix, W, is given by 





1 

-1 

I 

4 

0 


0 

3 

4 

-1 

1 



w = 
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It has eigenvalues Ao = 0, Ai = — 1 + ^, A 2 = —2, and A 3 = 
— 1 — The left eigenvector of W with eigenvalue, Ao = 0, is the 
stationary probability distribution, (p 5 | = This s y stern 

satisfies detailed balance because P 5 (n)w„ n+ i = P s (n + l)w„+i„. For 
example, P s (l)w lj2 = (i)(l) = i and P s (2)w 2 ,i = (ft)® = ft. 

(b) The matrix V is given by 



- 1 i 

0 

0 


? - 1 

A 

0 

V = 

0 f 

4 

-1 

& 

2 


^ 0 0 

£ 

2 

-1 

eigenvalues Aq = 0, Ai 

= - 

1+^ 



(0.196,0.392,0.679,0.588), 

(tpi 1 « (0.679,0.588, -0.196, -0.392), 

(fa | ~ (-0.196,0.392,-0.679,0.588), 

and (^ 3 1 ~ (-0.679,0.588,0.196,-0.392). 

(c) For the initial condition, Pi (n, 0) = 8 n> 1 , Pi ( n , f) can be written 

P!(n, 0 = P(n) s + (! \il>i)e Xit {il)i\n). 


Using the numbers from (b) we find 

Pi (2, t ) a 0.154 + (0.799)e“°' 567/ - (0.154)e" 2/ - (0.799)<r 1433 '. 
Note that Pi (2, 0) = 0 as it should. 


■ EXERCISE 5.3 Consider an asymmetric random walk on a periodic 
lattice with four lattice sites. The transition rates are vvi j2 = 3 = 

W 3 4 = W 41 = |, w 2) i = W 3 )2 = W 4 t 3 = wi ; 4 = and w,j = 0 for all other 
transitions. Write the transition matrix, W, and show that this system does 
not obey detailed balance. 

Answer: 


The transition matrix, W, is given by 


W = 


/-1 

1 
4 
0 

V ? 


a 
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The left eigenvector of W with eigenvalue, Ao = 0, is the stationary | 
probability distribution, (p s | =(5,^,5,^)- Therefore, detailed balance 
cannot be satisfied and we cannot find a symmetric matrix which governs 
the dynamics of this system. It is interesting that the long-time state is one 
for which it is equally likely to find the random walker at any lattice site. Yet 
because detailed balance is not satisfied, there is also a probability current 
around the lattice. 


5.D.3. Mean First Passage Time [9] 

One of the useful quantities we can compute is a mean first passage time. If we 
consider the random walk problem, the mean first passage time is the average 
time for the walker to reach some site, n p , for the first time assuming that the 
walker started at site n 0 at time t = 0. Computing this quantity involves a trick. 
We will assume that Pi ( n p , t) = 0 for all time. This way, if the walker steps on 
site n p , it can never return to the random walk. This is called an absorbing 
boundary condition at site n p . 

To be clear about what we are doing, we will let Q n (t ) denote the conditional 
probability to find the walker at site n, assuming that it starts at site n 0 at time 
t = 0 (we suppress its dependence on no for simplicity). Then Q n ( 0) = and 
Qn p (t) = 0 (the absorbing boundary condition). The equations of motion for 
Q n {t ) are the same as for Pi (n, t) [cf. Eqs. (5.55) and (5.56)], except that we set 
Qn p {t ) = 0. Thus, 

QQ (f) Ml M 

"L = Qm(t)yv m ,n(t) - Q n (t) wy„(t)) for n 7^ n p . 

m(^n p )= 1 m= 1 

(5.68) 

Note that we set Q np (f ) = 0 but w nPtn 7^ 0 in Eq. (5.68). In Dirac notation we 
write Eq. (5.68) as 


0Q(O 

dt 


= QM, 


(5.69) 


where Q n (t) = (no|Q(f)|n) and (m|M|n) = w W) „ - <5 m) „ w n>n > with 
m 7^ n p . The matrix M is not symmetric in general, and therefore its left and 
right eigenvectors will not be the same. We can use the methods of Section 5.C 
to expand the solution of Eq. (5.69) in terms of left and right eigenvectors of M 
and its eigenvalues (see Exericse 5.4). However, when using this approach one 
must check that the eigenvectors are complete. Since Q n (t ) must decay, at least 
for a finite lattice (the walker eventually steps on site, n p ), all the eigenvalues of 
M must be negative. 
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The probability that the walker is still “alive” (i.e., has not stepped on the 
site, n = n p ) at time t is 

M 

Pno(t) — Qn{t)- ( 5 . 70 ) 

«(A>) =1 

We now define 

fn 0 ,n P (t ) = {the probability to reach the site, n P , during 

the time interval t —* t + dt} (^ 71 ) 

We then note that to be “alive” at time t, the probability must equal the sum of 
the probabilities to either have “died” during the time t — > t + dt or still be 
alive at time t + dt. Thus, we can write 

Pno{t) =f no , np {t)dt + P no (t + dt). ( 5 . 72 ) 

If we now note that P no (t + dt) = P„ 0 (t ) + ( dP no {t)/dt)dt H , then we can 

write 

/«.,»,« = -^>. ( 5 . 73 ) 

From Eqs. ( 5 . 68 ) and ( 5 . 70 ), the probability density, f^ np {t), can also be 
written 


/*.*««- t 

nfrn P )= 1 

MM MM 

n'(^n P )= 1 m(^n P ) = l h'(^h/>)= 1 m=l 


= - J2 Qn>(t)Wn P ,n>{t))- 

n'(^n P )= 1 


( 5 . 74 ) 


Thus fn (h n P (t) can be expressed in terms of the probability to find the walker on 
neighbors of the site n P and the transition rate between n P and its neighbors. 

The average time it takes the walker to reach the site n p , given that it starts at 
site no at time t = 0, is 


• oo poo 

(t) = dt tf nQ ,n P (t) = dtPng^t), 

Jo Jo 


( 5 . 75 ) 
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where we have used Eq. (5.72) and have integrated by parts, (t) is the mean first 
passage time. 


■ EXERCISE 5.4 Consider an asymmetric random walk on a lattice with 
five lattice sites. Assume that the fifth site, P, absorbs the walker. The 
transition rates W 12 = W 13 = wi )4 = w\,p = W 2, 1 = W 2, 3 = w 2 , p = 

5,^1 = VV 3 i2 = W 3,4 =5, w 4 ,l = W 4j 3 = W 4 /> = Wp \ = Wp )2 = W/> 4 = 

| , and Wij = 0 for all other transitions, (a) Write the transition matrix, M, 
and compute its eigenvalues and left and right eigenvectors, (b) If the walker 
starts at site n = 3 at time t = 0, compute the mean first passage time. 

Answer: 


(a) The transition matrix, M, is given by 


It has eigenvalues Aj « —0.216, A 2 = —1.0, A 3 ~ —1.284, and A 4 = 
— 1.5. The left eigenvectors are 

(xtl ~ (0.286,0.204,0.265,0.204), 

(xfl =(0, -0.5,0, 0.5), 

<X 3 1 as (0.614,-0.054,-0.415,-0.054), 

<X 4 I — (—0.2, 0.3, —0.3, 0.3). 



The right eigenvectors are 

1.0 ^ 

0.950 
1.234 
0.950/ 


lx*) 


|X?> = 




( 10 ) 

t i 

/ —0.5 \ 

lx?) * 

-0.117 ! 

II 

1.0 

-0.901 j 

1 " L ° 


\ -0.117/ 

1 1 

\ 1.0 / 
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(b) The spectral decomposition of the matrix Q(t) can be written 

4 


Q« = Q(o)£y"lxf)(xf|. 


(i) 


i=l 


If we take matrix elements and note that (n 0 1 Q(0) jn) = <5„ no , we find 


(rto|Q(OI«) = ^2 eXit ( n o\xf)(xU n )- 


(2) 


i=i 


The mean first passage time is 


4 oo 

rr Jo 


dr(n o |Q(0l n ) 


1 .00 4 

dtPno(t) = J2 

o 

n= 1 i=l V A i / 


( 3 ) 


If we now let no = 3 and use the numbers from part (a), we find 
(t) « 5.33. 


5.E. BROWNIAN MOTION 

Brownian motion provides some of the most spectacular evidence, on the 
“macroscopic” scale, for the discrete or atomic nature of matter on the “micro- 
scopic” scale. The discreteness of matter causes fluctuations in the density of 
matter, which, in turn, causes observable effects on the motion of the Brownian 
particle. This can be seen if one immerses a large particle (usually about one 
micron in diameter) in a fluid with the same density as the particle. When 
viewed under a microscope, the large particle (the Brownian particle) appears to 
be in a state of agitation, undergoing rapid and random movements. Early in the 
nineteenth century, the biologist Robert Brown wrote a paper on this 
phenomenon [10] which received wide attention, and as a result it has been 
named for him. 

The modem era in the theory of Brownian motion began with Albert 
Einstein, who, initially unaware of the widely observed phenomenon of 
Brownian motion, was looking for a way to confirm the atomic nature of matter. 
Einstein obtained a relation between the macroscopic diffusion coefficient, D, 
and the atomic properties of matter. This relation is D = RT /NA6irrja, where R 
is the gas constant, Na — 6.02 x 10 23 mol -1 is Avogadro’s number, T is the 
temperature in kelvins, rj is the viscosity, and a is the radius of the Brownian 
particle [11, 12]. It has since been confirmed by many experiments on Brownian 
motion [13]. 
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In this section we derive the theory of Brownian motion starting from the 
Langevin equations for a Brownian particle. That is, we focus on a large particle 
(the Brownian particle) immersed in a fluid of much smaller atoms. The 
agitated motion of the large particle is much slower than that of the atoms and is 
the result of random and rapid kicks due to density fluctuations in the fluid. 
Since the time scales of the Brownian motion and the atomic motions are vastly 
different, we can separate them and focus on the behavior of the Brownian 
particle. The effect of the fluid on the Brownian particle can be reduced to that 
of a random force and a systematic friction acting on the Brownian particle. 

The Langevin theory of Brownian motion provides a paradigm theory for 
treating many-body systems in which a separation of time scales can be 
identified between some of the degrees of freedom. For this reason we consider 
it in some detail here. 


5.E.I. Langevin Equation [3, 4] 

Consider a particle in a fluid undergoing Brownian motion. For simplicity we 
will consider motion in one dimension. The results can easily be generalized to 
three dimensions. We assume that the particle is free to move in the fluid but 
that the effect of the fluid is included by adding a friction, which is proportional 
to the velocity, and a random force, £(t), due to random density fluctuations in 
the fluid. The equations of motion of the Brownian particle are 


dv(t) 

dt 

dx(t ) 
dt 


m m 


V M. 


(5.76) 

(5.77) 


where v(t) and x(t) are the velocity and position, respectively, of the particle at 
time t, m is the mass of the particle, and 7 is the friction coefficient. Equations 
(5.76) and (5.77) are the Langevin equations of motion for the Brownian 
particle. 

The random force, £(t), is a stochastic variable giving the effect of 
background noise, due to the fluid, on the Brownian particle. We will assume 
that £(r) is a Gaussian white noise process with zero mean so that {£( t ))^ = 0. 
The noise is Markovian and stationary and the average, ( )^, is an average with 
respect to the probability distribution of realizations of the stochastic variable 
£(0- We will not write the probability distribution explicitly. The assumption 
that the noise is white means that the noise is delta-correlated, 

= (5.78) 

and therefore (as we shall show in Section 5.E.2 its power spectrum contains all 
frequency components. The weighting factor, g, is a measure of the strength of 
the noise. Such a correlation function indicates that the noise is not correlated 
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from one instant to the next, and therefore it is impossible to represent a single 
realization of £(f) in terms of a continuously drawn line. The fact that the noise 
is Gaussian with zero mean (cf. Exercise 4.9 means that correlation functions 
containing an odd number of terms, £(f), are zero and that correlation functions 
containing an even number of terms, £(r), can be expressed in terms of sums of 
products of the pairwise correlation function, (£(ti)£(t 2 ))£- For example, 

(£(fl)£(*2)£(f3)£(f4))£ =(€(h)€(t2))z(€(t3)£(t4))z 

+ + <€(<.)€(*«)>{<€(» 2 )«(< 3 )> { . 

Equations (5.76) and (5.77) can be solved fairly easily. Let us assume that at 
time t = 0, the velocity and position of the Brownian particle are v(0) = vo and 
jc( 0) = Jto, respectively. Then the solution the Eqs (5.76) and (5.77) is 

v(f) = v 0 e~ hlm) ' + - ' dse-bW-’Hk) (5.79) 

m Jo 

and 

x(t)=x 0 + -(l -e- w “>')vo + - f<fe(l (5.80) 

7 7Jo 

Equations (5.79) and (5.80) give v(f) and x(t) for a single realization of £(f). 
Since £(f) is a stochastic variable, v(t) and x(t) are also stochastic variables 
whose properties are determined by £(r). The average velocity (subject to the 
condition that v(0) = vo) is (v(f)L = voe~^ m ^ and the average displacement is 
((x(t) - *o))f = " (1 - e" ( ^ m)f )v 0 . 

We can also obtain correlation functions from Eqs. (5.78), (5.79), and (5.80). 
If we make use of the fact that (vo^(/))^ = 0, then we can write 

<v(f 2 )v(/i)) { = ^e-<7/»)('»+9) 



The integral is easily done to yield 


<vfe)v(fi)) { = (vl - JG) e-W'-Jfe+M + (5.82) 

We can also obtain the variance in the displacement. If we use Eqs. (5.78) 
and (5.80) and the fact that (x 0 ^(f))^ = 0. we can write 

<W0 - *») 2 > e = £ (4 - 2 ^) " e_(7/m) ') 2 + £ ['■ - "0 - ' 

(5.83) 
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Thus, after a long time the variance goes as ((^(^ 2 ) — a:o) 2 )^ = ( g/l 2 )t 
(neglecting some constant terms). This is the same behavior that we saw for 
random walks in Chapter 4, if we choose a diffusion coefficient, D = g/ 2 ^ 2 . 

We can use a simple trick to determine the value of g for a Brownian particle 
in equilibrium with a fluid. Let us assume that the Brownian particle is in 
equilibrium with the fluid and average over all possible initial velocities, vo- We 
denote this “thermal” average by () T . By the equipartition theorem, for a 
particle in equilibrium the average kinetic energy is \k B T for each degree of 
freedom, \m{v l) T = \k B T, where k B is Boltzmann’s constant and T is the 
temperature in kelvins. If the Brownian particle is in equilbrium, its velocity 
correlation function must be stationary. Therefore, we must have vj = g/ 2 m 7 
so the first term on the right in Eq. (5.82) is removed. If we now take the 
thermal average of Eq. (5.82), we see that we must have g — 27 k B T. The 
correlation function can be written 


((v(f 2 )v(fi)) { ) T = 1 . (5.84) 

The absolute value on the time difference ensures that correlations always 
decay as the time difference increases. Information about the initial velocity of 
the Brownian particle decays exponentially. 

For the case when the Brownian particle is in equilibrium with the fluid, the 
variance of the displacement becomes 

((WO- * 0 ) 2 > f >r = [*-"0 -e- (7/m) ’)]- (5-85) 

where we have assumed that (xq) t = (vq) t = 0 and that xq and vo are 
statistically independent so that (xovo) = 0. Thus, after a long time, 
({(x(t) — xo) 2 )^) t = ( 2 k B T/j)t and the diffusion coefficient becomes D = 
k B T I'). The friction coefficient, 7, can also be determined from properties of the 
fluid and and hydrodynamics. For large spherical Brownian particles, we can 
assume that the fluid sticks to the surface. The friction coefficient is then the 
Stokes friction, 7 = 67rr]R, where 77 is the shear viscosity of the fluid and R is 
the radius of the Brownian particle (see Sect. (S10.G)). For very small 
Brownian particles, stick boundary conditions might not apply and the friction 
coefficient, 7, might be different. 


■ EXERCISE 5.5. Consider a Brownian particle of mass m which is 
attached to a harmonic spring with force constant k and is constrained to 
move in one dimension. The Langevin equations are 


dv 

dt 



m 


x + — £ 0 ) and 
m 
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where ujq = ^Jk/m. Let *o and vo be the initial position and velocity, 
respectively, of the Brownian particle and assume that it is initially in 
equilibrium with the fluid. Then by the equipartition theorem, the average 
kinetic energy is \rn{v q) t = \k B T and average vibrational potential energy is 
\uq{xq) t = \k B T. We also assume that xo and v 0 are statistically 
independent so (xovo) r = 0. (a) Show that a condition for the process to 
be stationary is that the noise strength is g = 47 k B T. (b) Compute the 
velocity correlation function, ({v(h)v(t\))^) T . 

Answer: The Langevin equations can be solved and give the following 
expression for the velocity at time t : 

v(f) = v 0 e~ rt C(t ) - ^7X 0 £ -r 'sinh(Af) + — [ dt'^(t')e~ r ^~^C(t - r'), 

A m J 0 

where C(t ) = cosh(At) — (r/A)sinh(Af), T = 7/m, and A = yjT 2 — Jq. 
If we use the fact that (jtov 0 ) r = 0 and assume that f 2 > t\, the velocity 
correlation function can be written 

((v(f 2 )v(r i)) e ) r =e~ T ^ + ^C(t 2 )C(t l )(v 2 0 ) T 

4- (A:o) 7 '^ _r(/2+ * l) sinh(At2)sinh(At I ) 

4-^ J ' dte~ r(t2+t '~ 2t) C(t 2 - t)C(h - t ). 

If we choose g = A^k B T to ensure stationarity then after some algebra we 
obtain 

« v (f2)v(fi)) e ) r =~~ e ~ r{t2 ~ h) cosh(A(r 2 -h)) -^sinh(A(t 2 — fi)) • 

A similar calculation for t\ > t 2 yields the same answer but with t\ <-> f 2 . 
Thus, 

((v(ti 4- T)v(ri))^) r = <T r l r| cosh(Ar) - ^sinh(A|r|) . 


5.E.2. The Spectral Density (Power Spectrum) 

A quantity of great interest in studying stationary stochastic processes is the 
spectral density (or power spectrum). If one has experimental data on a 
stochastic variable 'ip(t ) — that is, if one has a time series for ip(t ) — then it is 
useful to compute the power spectrum because this contains a great deal of 
information about the stochastic process, as we shall see below. 
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In practice, in dealing with data from experiments, one has a finite length of 
time series to work with. Let us assume that we have a time series of length T. 
We denote it 00; T ), where 00; T ) = 00) for — T /2^t^T/2 and 00; T) = 0 
otherwise. The lim 7^00; T) — 00). The Fourier transform of 00; T) is 

( OO 

dt^vj )^ . (5.86) 

— (X) 

Because 00; T) is real, we have 0(u>; T) = 0*(— to; T), where * denotes 
complex conjugation. 

The spectral density is defined as 

S^{u) = lim ^0*(u;;r)0(u;;r). (5.87) 

If we substitute Eq. (5.86) into (5.87), we find 


1 roc 

Sil),ip{u) = lim — 

/ —►OO I 


( OO 

— OO 




1 f C 


dr e ,w(t) lim - dt x ^{t\ + r; r)00i ; T) 
r-^ooi j_ 0o 

d T e ~ luJ ^ (ip(t x + r)00i))j, 


(5.88) 


where (00i + r)ip(ti))- t denotes the time average: 

1 f°° 

Wti+T)fKti))l = r dtuKh +r;T)^i;r). (5.89) 

r-^°o7 J_ oc 


Equation (5.89) denotes the average of the product of numbers, 
00i; T)00i 4- r; T), over each point of the time series, 00). Since we assume 
that the stochastic process is stationary, the stochastic properties of the time 
series don’t change as we shift along the time series. Therefore we expect that 
the time average, (00i -I- r)00i))j, should be equal to the statistical average, 
(■0(6 + r)00 i))^, in which we average the product, 00i + r)00i), at a given 
point, t = t\, but over an infinite number of realizations of the time series, 00), 
at time t. Therefore, for a stationary process, we expect that 

C^(t) = (0(ti + r)0(fi))^ = (00! + r)00i)) ? (5.90) 

If we now combine Eqs. (5.89) and (5.90), we obtain the following expression 
for the spectral density: 




• OO 

dre~ lu>T (00i + t)00i))^, = 

J — OO 


* OO 
J — OO 


dre “"C,;,. 


(5.91) 




Fig. 5.2. (a) The correlation function, C vv (t). (b) The spectral density, S vv (to). 
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It is interesting to compute the spectral densities for the Brownian motion 
described in Section 5.E.I. Let us consider the case when the Brownian particle 
is in thermal equilibrium with the fluid and the process is stationary. From 
Eq. (5.78), the spectral density for the white noise is 

'OO 

S u (w) = dre-^M f, + T)((h)) i = g = 2'yksT. (5.92) 

J— OO 

Thus, a white noise process contains all frequencies with equal weight. The 
correlation function, C#(t), and spectral density, %(o;), are shown in Fig. 5.1. 
The spectral density for the velocity time series is 

( °° 'IkoT'v 

_Jre-^( V ( h + r)v( (l )> { • „ ,.v " : > ( 5 - 93 ) 

Plots of the velocity correlation function and spectral density for the Brownian 
particle are given in Fig. 5.2. 


f 

j ■ EXERCISE 5.6. Compute the spectral density, S V)V (u;), for the 
j harmonically bound Brownian particle considered in Exercise 5.5. Plot the 
j velocity correlation function, C v v (t), and spectral density S VjV (cu) for the 
| case ujo > T (this corresponds to an underdamped Brownian particle). 

| 

j Answer: The velocity correlation function can be written 
I C,„(t) = «v(<! +T)v(/,)> { > 1 . = IM c - r lr| ^cos(«|T|)-ism(tf|r|)], 


where T = ^ and 6 = y/u% — T 2 = —/A. The spectral density can be written 

■°0 n j 1 

5 V)V (u;) = dr cos (urr)e -r ' T l — cos (<5| r | ) — - sin <5 1 r | ) 

Jo LI ® 

47 JcbTlJ 2 

m(ujQ — 2 6 lj + + 2 6uj + u> 2 ) 


Both C VjV (r) and 5 V)V ( u) are plotted in the accompanying graphs. 

Because the Brownian particle is underdamped, the velocity correlation 
function, C v v (r), oscillates as it is damped out exponentially. The spectral 
density, S VjV ( u), has peaks near lj = ±u>o and the peaks have an 
j approximate width, ^ . 


When the noise is not “white”, the spectral density still affords a means of 
making contact with experiment [14]. 
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SPECIAL TOPICS 

^ S5.A. Time Periodic Markov Chain 

Let us now consider the case in which the transition probability is time- 
dependent but periodic in time with period N. That is, Q n ,m{s) = 
P\\\{n, j|m, j+ 1) = Pi\i(n, s + N\m, s + N + 1). The probability vector 
after one period of the transition probability can be written 

(P(A0I = (P(0)|Q(0)Q(1) x ... x Q(N- 1) = (P(0)|U, (5.94) 

where U = Q(0)Q(l)x---xQ(^-l) is the transition probability that 
takes the system from the initial state, (P(0) j , to the state after one period, 
(P (N) | . More generally, the probability vector after l periods of the transition 
matrix is 


<P(W)| = (P(0)|U'. (5.95) 

We can expand the probability vector, (P(IN)\, in terms of left and right 
eigenvectors of the transition matrix, U. Let A a be the ath eigenvalue of U, and 
let (x«| and \ip a ) denote the left and right normalized eigenvectors of U, 
respectively, where a = l,...,Af. Thus, U|V><») = Aa|^ a ) and (x tt |U = 
(Xa | A tt . We then proceed as we did in Section 5.C.I. If U is an M x M 
matrix, then its spectral decomposition is given by 

M 

U = ^A a |V' a )(Xa|. (5.96) 

a= 1 

The probability vector after / periods of the transition matrix is given by 

M 

(P(W)| =^Ai(P(0)|^>(x a |. (5.97) 

a— 1 

The probability for the nth realization of the stochastic variable Y at time s = IN 
is given by 


M M 

P(n , IN) = EE A l a P{m, 0 )ip a (m)xa(n). 

m= 1 a=l 


(5.98) 


We will demonstrate the use of these equations in Exercise 5.7. 


■ EXERCISE 5.7. Let us consider a stochastic variable Y with three 
realizations, y(l), y( 2), and y(3). Let us assume that the transition 
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probabilities between these states are <2i,i(- s ') = 02,2 0*') — 0 3) 3 (s) = 0, 
01, 2 (j) = 02, 3 (s) = 03,1 (j) = COS 2 (27Ts/3), and 01, 3 ($) = 02,1 W = 
03, 2 (j) = sin 2 (27T5/3). If initially the system is in the state y(l), what is 
the probability to find it in the state y(2) after l periods of the transition 
matrix? 

Answer: The transition matrix has a period N = 3. In general it is given by 

/ 0 cos 2 (2f) sin 2 (2 m) \ 

Q(j) = sin 2 (^) 0 cos 2 (2 m) I . 

^cos 2 (2f) sin 2 (2 m) 0 / 

Therefore, 


cos 2 (2f) sin 2 (2 m) 


/ 0 1 0 
Q(0) — ( 0 0 1 


and Q(l) = Q(2) = 


VI 0 0/ 

and the transition matrix U becomes 


U = Q(0)Q(1)Q(2)= & A 1 


inc eigenvalues ui u aie ivi = i , 55 

9 = tan -1 (3\/3/13). 

The right eigenstates can be written 


1 

3 

_9 \ 


16 

8 

16 1 


9 

1 

3 


16 

16 

8 

1 

3 

9 

_L 

| 

8 

16 

16/ 


e~ 

1 9 , and A 3 

= — j6 e+,e , where 


ki) = 


/ — e +in/3 
\th)= ( —e ~ i7T / 3 


l^3> = 


The left eigenstates can be written 

(*i I = 6 > 3 ) > <X 2 1 = (- \ e _i7r/3 , - 5 e + ™/ 3 , |) 

and 

<»l = 

We can now compute the probability P( 2, 3/). The initial probability density 
is P(n, 0) = From Eq. (5.98), the probability P( 2, 31) is 

P(2, 31) = A'^(l) X i(2) + A'^(1 )x 2 (2) + A'^ 3 (1)x 3 (2) 
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► S5.B. Master Equation for Birth-Death Processes [3, 4] 

One of the most common applications of the master equation is the description 
of birth-death processes such as one finds in chemistry and population 
dynamics. Birth-death processes are processes in which transitions can only 
take place between nearest-neighbor states. For example, if one considers a 
population in which only one individual is produced at each birth and one 
individual dies at each death, then we have a typical birth-death process. 


^ S5.B.1. The Master Equation 

For concreteness let us consider a population of bacteria. We assume that at 
time t, there are n bacteria and we make the following assumptions: 

(1) The probability of a bacterium dying during the time t — * t + At is given 
by w„ )B _i(f)Af = d n (t)At. 

(2) The probability of a bacterium being created in time t — ► t + At is given 
by w^+i (f)Ar = b n (t)At. 

(3) The probability that no change occurs in the bacteria population in time 
t -* t 4- Atis given by [1 - Y!i= \ w mJ (t)At\ = [1 - (b m {t) + d m (t))At\. 

(4) The probability of more than one birth or death in time t — ► t 4 Ar is 
zero. 

The transition probability can be written 


t\n , t + At) =[1 - (b m (t) 4- d m (t))At\6 m , n 

+ (b m (t) 6 njm+ 1 + d m (t)6 n)m -\ At 4- 

and the master equation takes the form 

dP\ (n, t) 


(5.99) 


dt 


=b n - i {t)P\ (n ~ 1 , t) 4- d n+ i (t)P i (n + 1 , t) 
- (b n (t) 4- d n (t))P(n, t ). 


( 5 . 100 ) 


Note that we have allowed the birth and death rates, b n {t) and d n {t) respectively, 
to depend on time. In most applications of Eq. (5.100) that we will consider 
here, they will be independent of time. 

Let us consider the case in which b n and d n are independent of time. Then the 
master equation will have at least one stationary state, P„ = P(n, oc), which is 
independent of time and satisfies the condition dP^/dt = 0. (If the transition 
matrix, W, is made up of uncoupled blocks, then there may be more than one 
stationary state.) The master equation for the stationary state can be written in 
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the form 


0 = d n+l P* +] - b n P s n - 0 i n P s n - bn-xP U)- (5.101) 

Note that since d n+l P* +l - b n P s n = d n P s n - b n -\P s n _ x , the quantity J = 
b n -\P s n -\ ~ d n P s n must be independent of n. The quantity J is just the net 
probability current between pairs of sites. 

The case where no probability flows, J = 0, is equivalent to detailed balance 
since 


bn-lPLx = dnP S n . 


(5.102) 


Systems obeying detailed balance are very similar to systems in thermodynamic 
equilibrium, since no net probability current flows between microscopic states 
of those systems. For systems obeying detailed balance, we can iterate 
Eq. (5.102) to obtain an expression for P s n in terms of PJ: 


Pt = 


b n -ib n -2...b 0 

dndn—l—d\ 


(5.103) 


The value of Pq is obtained by the condition that the total probability be 
normalized to one, Y^n=o P n — 1 • 

The full master equation can only be solved exactly for some special cases. 
In the following two sections, we give some techniques for solving it exactly for 
some of the few cases where that is possible. In Section S5.B.2 we consider the 
case when the birth and death rates depend linearly on the population number. 
In Section S5.B.3 we consider the case when the birth and death rates depend 
nonlinearly on the population number. We will discuss some approximation 
schemes for solving the master equation in Section S5.D. 


► S5.B.2. Linear Birth-Death Processes 

Let us assume that the probability of a birth or death is proportional to the 
number of bacteria present and is independent of time. Then b n = (3n and 
dr i = 7 n, where (3 and 7 are constants, and the master equation takes the form 

dPx(n, t ) 

- P(n - l)Pi (n - 1 , t) + 7 (n + \)P\ (n + 1 , t) - (fin + 7 n)P(n, t ). 

(5.104) 

Equation (5.104) describes a linear birth-death process because the 
coefficients of Pi on the right-hand side of the equation depend linearly on 
n - Note that n is the number of bacteria and therefore n^0. Thus, Eq. (5.104) 
must never allow probability to flow into regions where n < 0. We see that 
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Eq. (5.104) satisfies that condition. If n — — 1, the coefficient of P\(n+ 1, t) is 
zero so flow can never occur from positive to negative values of n. Equation 
(5.104) is said to have a natural boundary at n — 0. This may not always be the 
case and one must be careful when using such master equations. 

Linear master equations depending on discrete stochastic variables are often 
most easily solved by means of a generating function, F(z, t). We will use this 
method to solve Eq. (5.104). The generating function is defined as 

00 

F(z, 0 = Y *)• (5.105) 

n=— oc 


Since we have a natural boundary at n = 0, it does not matter that the 
summation in Eq. (5.105) extends to n — — oo. Various moments of the 
population number, n, are obtained by taking derivatives of F(z, t ) with respect 
of z and then allowing z — ► 1 . For example, 


(n(t)) = lim 

z — ► l 


dF(z, t) 
dz 


oo 

Y nPi *) 

n— — oo 


(5.106) 


and 


(n 2 (t)) - ( n(t )) = lim 

Z — ► 1 


&F(z, t) 

dz 2 


OO 

Y {n 2 -n)Pi{n, t ). 

n=—oo 


(5.107) 


Higher moments may be obtained in an analogous manner. 

We can now obtain a differential equation for the generating function, 
F(z, t ). Let us multiply Eq. (5.104) by z? and sum over n. Note, for example, 
that 


OO 

Y ^ n ~ i ) p d n ~ o = 

n =— oo 


d_ 

dz 


E ^' p ' 


(n- 1, t) =z 2 


dF{z, t) 
dz 


and 


OO I oo 

Y 1 ) p i( n+ o ~~t Y z” +lp i( w + !> o = 

n=- oo ^ n=— oo 

We obtain the following differential equation for F(z, t): 


dF(z, t) 
dz 


dF 

dt 


(z-t)(0z- 


( 5 . 108 ) 


If we substitute Eq. (5.105) into Eq. (5.108) and equate like powers of z, we 
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retrieve Eq. (5.104). Equation (5.108) is a first-order linear partial differential 
equation and may be solved using the method of characteristics. 


■ Method of Characteristics [15]. Let us consider a first order linear 
differential equation of the form 


dF ^<) +g(z) m^ +h(z)F(z<t) = o, 


(i) 


dt ' °"' dz 

where g(z ) and h(z) are arbitrary functions of z. Write F(z ) t) in the form 

F(z, t) = [W ^>]$(z, r), (2) 


where $(z, t) is an unknown function of z and t. If we substitute Eq. (2) into 
Eq. (1), we find a partial differential equation for <E>(z, t ): 


«•<*■») +gW «*£ii> = o. 


dt 


dz 


( 3 ) 


We now find the characteristics for Eq. (3). These are lines in the z-t plane 
along which $(z, t) is constant. These lines satisfy the equation 
dt = dz/g(z ) or J z dz/g{z) - t = Ci, where Ci is a constant. It is easy to 
show that any function of J z dz/g(z) — t is a solution of Eq. (3). Generally 
we write the solution as 4>(z, t ) = $(<J and the solution to Eq. (1) 

takes the form 


F(z, t ) = 


( 4 ) 


If we use the method of characteristics to solve Eq. (5.108), we find 


F(z, t) = F 


. 0z - 7 / 


,(P~ 


(5.109) 


Although we know that F(z, t ) is a rather complicated function of / 3 , 7 , and t , 
we still do not know the exact form of F(z, t ). This we obtain from the initial 
conditions. Let us assume that at time t = 0 there are exactly m bacteria in the 
population. Thus, P(n, 0) = 6 M ?M and F(z, 0) = z m . From Eq. (5.109) we find 


F 


\Pz- 7 / 


= z 


(5.110) 


If we now let u = /3(z — 1 )/(/3z - 7 ), then we can write z = (7 u — / 3)/j3(u — 1 ), 
so 
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and 


/ 7 (z - l)e ^ 7)> - Pz + j \ m 
\/3(z — 1 )e^~^ t - (3z + 7/ 


(5.112) 


Equation (5.112) is exact and contains all possible information about the linear 
birth-death process. We can now obtain Pi (n, t ) and all the moments of n from 
Eq. (5.112). In order to obtain Pi(n, t) we must expand Eq. (5.112) in a Taylor 
series in z. Pi(n, t) is the coefficient of z". The first moment is 


MO) = 



= me^ 7 ^. 


(5.113) 


The variance is 


(n\t)) - {nit)} 1 = )'(i - eW-'t)') (5.114) 

If the birth rate is greater than the death rate, the population will grow 
exponentially with time. If the death rate is greater than the birth rate, the 
population will die out. Of course, Eq. (5.104) gives a vastly oversimplified 
description of the dynamics of a bacteria population. We have left out factors 
involving food supply, temperature, and so on, but it does give a good starting 
point. For an excellent discussion of the application of master equations to 
population dynamics, see Ref. 16. 

Birth-death equations can also be applied to chemical reaction kinetics. For 
example, in Problem S5.2 we consider the reaction 

A + X^A + Y, 
h 


where the number of A molecules is kept constant but the number of X 
molecules and Y molecules can vary. k\ (^2) is the probability per unit time that 
a reaction takes place if an X molecule (Y molecule) collides with an A 
molecule. The reaction in the forward direction requires n A A molecules, nx + 1 
X molecules, and n y — 1 Y molecules to produce a system containing n A A 
molecules, nx X molecules, and ny Y molecules. Similarly, the backward 
reaction requires A molecules, nx — IX molecules, and n Y + 1 Y molecules 
to produce a system containing ha A molecules, nx X molecules, and ny Y 
molecules. The transition rate for the forward reaction depends on the number 
of binary collisions, n^(nx + 1), between A molecules and X molecules. 
Similarly, the transition rate for the backward reaction depends on the product, 
>u(wy + l). Thus, the transition rate for the forward reaction i s 
wi = k\nA{n x + 1), and for the backward reaction it is m >2 = k 2 n A (n Y + 1 ). 
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If we assume that the total number of X molecules and Y molecules is 
constant and equal to N, then the master equation can be written 

9Pl ft l ~ = k i( N ~ n + l)Pi (n - 1, t) + ki(n+ l)Pi(n + 1, t) ^ 
- ( kin + ki{N - n))P(n, t). 

where n is the number of X molecules and N - nis the number of Y molecules. 
For simplicity we have absorbed n\ into k\ and ki. We leave it as a homework 
problem to solve this master equation. 


^ S5.B.3. Nonlinear Birth-Death Processes [3, 4, 1 7] 

Some of the simplest examples of nonlinear birth-death processes come from 
population dynamics and chemical reaction kinetics. A nonlinear birth-death 
process is one for which one or both of the transition rates, b n (t) and d n (t), 
depend nonlinearly on n. The master equation for nonlinear birth-death 
processes usually cannot be solved exactly, except for the long-time state. 
However, for binary nonlinear processes they sometimes can be solved exactly 
because the partial differential equation for the generating function might be 
solvable in terms of orthogonal polynomials. 

An example of an exactly soluble nonlinear system is the binary chemical 
reaction [18] 


2X-^B. 

The transition rate for this reaction is proportional both to the reaction rate, k, 
and to the number of different ways, \ tix(nx — 1), to form pairs of X molecules 
if there are n x X molecules in the system. If the system has nx + 2 X molecules 
and n B — IB molecules before the reaction, then it will have nx X molecules 
and Me B molecules after the reaction. The transition rate is 
w = | ( nx + 2 )(nx + 1). The master equation can be written 

— ~ = \ ( n + 2)(n + 1 )P(n + 2, t)-^n{n- l)P(n, t ). (5.116) 

This equation has a natural boundary at n = 0 since probability cannot flow 
into P(— 1, t) from above. The equation for the generating function takes the 
form 
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The generating function, F(z, t), in terms of Gegenbauer polynomials can be 
written 


F(z, t ) = Y^Ke~ {k,1)n{n ' X]t C- l l 2 {z). (5.118) 

n= 0 

The coefficient A n depends on initial conditions. The first few Gegenbauer 
polynomials are Cq 1 /2 (z) = 1, Cj" 1/2 (z) = -z, 1/2 (z) = - (1 - z 2 ), and 

C 3 / 2 (z) =\z( 1 - z 2 ). We shall not attempt to compute A n here. Solutions 
can be found in Ref. 17. 

It is important to note that most nonlinear master equations cannot be solved 
exactly. It is then often necessary to resort to approximation schemes or 
numerical methods for solutions. 


► S5.C. The Fokker-Planck Equation [3, 4, 19] 

The Fokker-Planck equation is the equation governing the time evolution of the 
probability density for the Brownian particle. It is a second-order differential 
equation and is exact for the case when the noise acting on the Brownian 
particle is Gaussian white noise. The derivation of the Fokker-Planck equation 
is a two-step process. We first derive the equation of motion for the probability 
density p(x, v, t) to find the Brownian particle in the interval x — ► x + dx and 
v — > v -p dv at time t for one realization of the random force £(t). We then 
obtain an equation for P(x, v, t) = (p(x, v, t))^ the average of p(x, v, t) over 
many realizations of the random force £(t). The probability density P(x, v, t) is 
the microscopically observed probability density for the Brownian particle. Its 
dynamical evolution is governed by the Fokker-Planck equation. 


► S5.C.1. Probability Flow in Phase Space 

Let us obtain the probability to find the Brownian particle in the interval 
x — ► x + dx and v — ► v -f dv at time t. We will consider the space of coordinates, 
X = (jc, v) (jc and v the displacement and velocity, respectively, of the Brownian 
particle), where — oo < x < oo and — oo < v < oo. The Brownian particle is 
located in the infinitesimal area, dxdv, with probability p(x,v,t)dxdv. We 
may view the probability as a fluid whose density at point (jc, v) is given by 
p(jc,v,t). The speed of the fluid at point (jc, v) is given by X = (x, v). Since 
the Brownian particle must lie somewhere in this space, we have the 
condition 



dvp(x,v,t ) = 1 . 


(5.119) 
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Fig. 5.3. A finite area in Brownian particle phase space. 

Let us now consider a fixed finite area, Ao, in this space (cf. Fig. 5.3). The 
probability to find the Brownian particle in this area at time t is P{Aq ) = 
J A dxdvp(x , v, t ). Since the Brownian particle cannot be destroyed, and change 
in the probability contained in Ao must be due to a flow of probability through 
the sides of Aq. Thus 


-l 


* 

dxdvp{x , v, t ) 

Ja 0 


() p(x,v,t)X ■ dS 0 , 

JLo 


(5.120) 


where dSo denotes a differential surface element along the edge of area A 0 , pX 
is the probability current through the edge, and Lo is the line around the edge of 
area element A 0 . We can now use Gauss’s theorem to change the surface 
integral into an area integral, § Lq p(x, v, t)X • dSo — J Aq dxdv'Vx • (Xp(x, v, t)), 
where Vx denotes the gradient, V x = (d/dx, d/dv). We find 



dxdvVx • (Xp(x,v,t)). 

JAq 


(5.121) 


Since the area A 0 , is fixed, we can take the time derivative inside the integral. 
Since the area Ao, is arbitrary, we can equate integrands of the two integrals in 
Eq. (5.121). Then we find that (dp(t)/dt) = -Vx • (Xp(t)), or 


d P(*) n ^PW) 3(vp(0) 

~W - " Vx ■ {xp{,)) ~ — Fx FT~ 


(5.122) 


where we have let pit) = p(x, v, t ). 


► S5.C.2. Probability Flow for Brownian Particle 

In order to write Eq. (5.122) explicitly for a Brownian particle, we first must 
know the Langevin equation governing the evolution of the Brownian particle. 




268 


STOCHASTIC DYNAMICS AND BROWNIAN MOTION 


Let us assume that Brownian particle moves in the presence of a potential, V(x). 
The Langevin equations are then 


dv(t) 

dt 


--v(f) +-F(x) + ~£(t) 
m m m 


and 


dx(t) 

dt 


= v(0, 


(5.123) 


where the force, F(x) equals — ( dV(x)/dx ). If we substitute these equations 
into Eq. (5.122), we find 


dp(t) = d(vp(t)) i d(vp(t)) 1 F() dp(*) 

dt dx m dv m U dv m ?U dv (5.124) 

= -Lq p(t) -Li{t)p{t), 

where the differential operators Lq and L\ are defined as 


d 7 7 d 1 . d 

Lq = v- -v— + -/ r (jc) — 

dx m m dv m dv 


and 


u =-&)£-• 


(5.125) 


Since £(t) is a stochastic variable, the time evolution of p(x,v,t) will be 
different for each realization of £(t). However, when we observe an actual 
Brownian particle we are observing the average effect of the random force on it. 
Therefore, we introduce an observable probability, P(x,v,t)dxdv, to find the 
Brownian particle in the interval x — > x + dx and v — ► v + dv. We define this 
observable probability to be 

P(x, v, t) = {p(x, v, t)) c (5.126) 


We now must find the equation of motion of P(x, v, t ). 

Since the random force, £(t), has zero mean and is a Gaussian white noise, 
the derivation of P(x , v, t) is straightforward and very instructive. It only takes a 
bit of algebra. We first introduce a new probability density, o{t), such that 


p(t) = e L Ot o(t ). (5.127) 

Using Eqs. (5.124), (5.126), and (5.127), it is easy to show that o(t) obeys the 
equation of motion: 


dcr(t) 

dt 


-V{t)o{t), 


(5.128) 


where V(t) = e + ^ ot Li(t)e~^ ot . Equation (5.128) has the formal solution 
cr(t) = exp j^— j dt'V(t') j er( 0 ). 


(5.129) 
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Let us now expand the exponential in Eq. (5.129) in a power series. Using 
the identity e? = / n -)» we obtain 


^(0 




a(0). 


(5.130) 


We now can take the average, ()^, of Eq. (5.130). Because the noise, £(f), has 
zero mean and is Gaussian, only even values of n will remain (cf. Exercise 4.9). 
Thus, we find 


wo> f = 




2n' 


cr(0). 


(5.131) 


We can use some results from Exercise 4.9 to simplify Eq. (5.131). The 
average, dt'V(t')) 2n )^, will decompose into (2n\/n\2 n ) identical terms, each 
containing a product of n pairwise averages, (J Q f dtiV(ti) Hq dtjV (tj)) Thus, Eq. 
(5.131) takes the form 


= 


E 

M=0 


1 /I 


n\ \2 


dti 


*i(Hh)Hh))A k(o). 


(5.132) 


We can now sum this series to obtain 


(*('))£ = exp k 


dti 


d ti (V(ti)V(h)) t 


^( 0 ). 


(5.133) 


Let us compute the integral in Eq. (5.133), 

\\dtiU ( n h) n^ 


8 

2m 2 

8 

2m 2 


dt 2 


dt\6(t 2 - ti)e +Lotl ~e~ Lo{h ~ tl) ~e~ Loti 

0 C7V O V 

d 2 

,+Lot\ p-Lot] 

dv 2 


(5.134) 


If we substitute Eq. (5.134) into Eq. (5.133) and take the derivative of Eq. 
(5.133) with respect to time t, we find the following equation of motion for 

< 5135 ) 

With this result, we can obtain the equation of motion of P(x , v, t) =(p(x, v, t))^. 
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Let us note that ( p(t ))^ = U(t)(cr(t))^ where U(t) = e and take the 
derivative of ( p(t ))^ with respect to time t. We then obtain 






= -i.<pw) t + -Lo(m + 


(5.136) 


If we combine Eqs. (5.125), (5.126), and (5.136), the equation for the 
observable probability density, P(x,v,t), becomes 


dP_ _ dP_ d_ 
dt V dx + dv 


— v — —F(x) )P 
m m 


+ 


8 


d 2 P 


2m 2 dv 2 ’ 


(5.137) 


where P = P(x,v,t). Equation (5.134) is the Fokker-Planck equation for the 
probability P(x,v,t)dxdv to find the Brownian particle in the interval 
x — * x + dx and v — * v -f dv at time t. 

It is important to note that the Fokker-Planck equation conserves probability. 
We can write it in the form of a continuity equation 

dP 

— =-V-J, (5.138) 

where V = x(d/dx) + \{d/dv) is a gradient operator in the (jc, v) phase space 
and J is the probability current or flux, 

J = juP-v(lvP-2p{x)p + J®?\, (5.139) 

\m m 2m l dv J 

in the (x, v) phase space. By the same arguments used in Eqs. (5.120) and 
(5.121), we see that any change in the probability contained in a given area of 
the (x, v) phase space must be due to flow of probability through the sides of the 
area, and therefore the probability is a conserved quantity. It cannot be created 
or destroyed locally. 

In this section we have derived the Fokker-Planck equation for a Brownian 
particle which is free to move in one spatial dimension. The Fokker-Planck can 
be generalized easily to three spatial dimensions. However, when the force F(x) 
couples the degrees of freedom, little is known about the details of its 
dynamical evolution. Below we consider Brownian motion in the limit of very 
large friction. For this case, detailed balance holds, the force can be expressed 
in terms of a potential, and we can begin to understand some of the complex 
phenomena governing the dynamics of the Fokker-Planck equation. 


^ S5.C.3. The Strong Friction Limit 

Let us now consider a Brownian particle moving in one dimension in a potential 
well, V(x), and assume that the friction coefficient, 7 , is very large so that the 
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velocity of the Brownian particle relaxes to its stationary state very rapidly. 
Then we can neglect time variations in the velocity and in the Langevin 
equations [Eq. (5.123)], we assume that ( dv/dt ) « 0. The Langevin equations 
then reduce to 


^■ = -F(x)+-i(‘), (5-140) 

at 7 7 

where F(x) = ~(dV(x)/dx). We now can use the method of Section S5.C.2 to 
find the probability P(x,t)dx to find the Brownian particle in the interval 
x —> x + dx at time t. The probability density, P(x, t ), is defined as the average, 
P{ jc, t) — (p(x, t ))^, where the equation of motion for the density, p(x, t ), is 
given by 


dp{t) _ d(xp) _ 1 d(F(x)p) 1 dp 

dt dx 7 dx 7 dx 

= -Lop(t) - Li(t)p(t). 


The differential operators Lq and L\ are defined as 


j dx 7 ox 




If we now substitute into Eqs. (5.136) and (5.137), we obtain 

dP{x,t) _ 1 <9 fdV . g dP(x , t) \ dJ 

dt ydx\dx K,) 2 7 dx ) dx 5 


(5.141) 


(5.142) 


(5.143) 


where J = — (l/j)(dV/dx)P + (g/2j 2 )(dP/dx) is the probability current. 
Equation (5.143) is now a Fokker-Planck equation for the probability density 
P( x, t ) to find the Brownian particle in the interval x -+ x + dx at time t. 
Because Eq. (5.143) has the form of a continuity equation, the probability is 
conserved. 


^ S5.C.4. Solution of Fokker-Planck Equations with One Variable 

For the case of a “free” Brownian particle, one for which V(x) =0, the 
Fokker-Planck equation (5.143) reduces to the diffusion equation 

dP(x,t) _ g d 2 P(x,t) _ d 2 P{x,t) 
dt li 2 dx 2 dx 2 


(5.144) 
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As we showed in Section 4.E, this has a solution 


P(x, t ) 



(5.145) 


Note that (1/D) = (l^ 2 /g) = (7 /k p T). Thus for large friction coefficient, 7 , the 
spatial relaxation is very slow. 

For the case when V{x) ^ 0, we can obtain a spectral decomposition of the 
probability density P(x,t). Let us first introduce a rescaled time r = tj 7 and 
write the Fokker-Planck equation (5.143) as 


dP(x , r) 
dr 


d 2 V dVdP g d 2 P 
dx 2 + dx dx 27 dx 2 


- L fp P(x , r). 


(5.146) 


The operator, L lp = (d 2 V /dx 2 ) -I- (dV /dx)(d/dx) + (g/2^)(d 2 /dx 2 ), is a non- 
self-adjoint operator because of its dependence on the first-order partial 
derivative. However, it is possible to rewrite the Fokker-Planck equation in 
terms of a self-adjoint operator via a simple transformation. Then the solutions 
become more intuitive. 

Let us write the probability in the form 


P(x,r) = e lV ^ x ^ g ^(x, r), (5.147) 

where ^{x,t) is a function to be determined. If we now substitute into Eq. 
(5.146) we obtain the following equation for \P(x, r): 


0 tt(x, r) 
dr 



2 _ 

2 g 



tf(*,r) + 


g d 2 ^ 

27 dx 2 


-H fp V(x, t). 


(5.148) 


The operator, H FP = -{\{d 2 V / dx 2 ) - ( 7/2 g)(dV/dx) 2 ) - (g/2'j)(d 2 /dx 2 ), is 
a self-adjoint operator and we can use the many well established techniques for 
dealing with such operators. We will let (f> n {x) and A„ denote the nth eigenvector 
and eigenvalue, respectively, of H FP so that H FP (j> n (x) = X n 4>n{x)- The 
eigenvectors are complete and can be made orthonormal so that 


dx(f>n! (x)(f) n {x) — 6 n ' ,n ■ 


(5.149) 


Furthermore, the eigenvalues are real and must have zero or positive values in 
order that the probability remains finite. We can expand ^(jc, t) in terms of the 
eigenvectors and eigenvalues of H FP : 


(jc, r) 


00 


E 

n= 0 


a„e Kr <p„(x). 


(5.150) 
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It is interesting to note that Hp P has at least one zero eigenvalue, which we 
denote Ao = 0, and a corresponding eigenvector, (f>o(x), which satisfies the 
equation 



2 g 



(f>o{x) + 


8 d 2 (f> p(x) 
27 dx 2 


= 0 . 


(5.151) 


Equation (5.150) has the solution 

Mx) = Ce~' v M' s , (5.152) 

where C is a normalization constant. This is just the transformation used in Eq. 
(5.146). Therefore we can now combine Eqs. (5.147), (5.150), and (5.152) and 
write the probability as 


P(x, t) = 4>l(x) + ^>2 a n e XnT (f>o{x)(l) n (x). (5.153) 

n= 1 


In this form, it is clear that the probability is conserved due to the 
orthonormality of the eigenstates. If we integrate over Eq. (5.153), we obtain 



*00 

dx(j> o(x) 2 = 1. 

J- 00 


(5.154) 


The coefficients, a n , can be determined from the initial conditions. Let us 
assume that we are given P(x , 0). Then we write 


P{x,0) = 4>l{x) + ^>2a n (f>o(x)(f) n (x). (5.155) 

n= 1 


If we now divide through by (f>o(x), multiply by (f>n 0 {x), and integrate over x, we 
obtain 


a n 0 



Ki x ) 

M x ) 


P( X ,0). 


(5.156) 


After a long time, the probability approaches the stationary state: 

P(x,o o) = <f> o(x). 


(5.157) 


There are several examples of Fokker-Planck equations with one variable 
which can be solved analytically. We will consider one of them in Exercise (5.8) 
and leave the others as homework problems. 
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This method can also be extended to Fokker-Planck equations with two or 
more spatial degrees of freedom when a transformation analogous to Eq. (5.146) 
can be found which allows us to write the Fokker-Planck equation in terms of a 
self-adjoint operator. For such cases, it is possible that the dynamics governed 
by the self-adjoint operator can undergo a transition to chaos. Examples of such 
cases have been studied in Refs. 22-24. 


■ EXERCISE 5.8. Consider the “short-time” relaxation of a free 
Brownian particle. The Langevin equation for the velocity is m(dv/dt ) — 
“7 v +•£(/). (a) Find the Fokker-Planck equation for the probability 
P(v, t) dv to find the Brownian particle with velocity v — > v + dv at time t. 
(b) Solve the Fokker-Planck equation, assuming that at time t = 0 the 
velocity is v = vo- 

Answer: 

(a) To obtain the Fokker-Planck equation, we will follow the method of 
Section S5.C. The equation for the probability p(v, t) for a specific 
realization of the random force £(t) is 

dp 

dt 


dvp = 7 dvp 1 dp 
dv m dv m dv 


(i) 


From Eq. (1) we see that Lo = 4- v{d/dv)). We can now 

plug into Eq. (5.136) to obtain a Fokker-Planck equation for the 
probability density P(v, t). We find 


dP 

dt 


7 d(vP) g d 2 P 
m dv 2 m 2 dv 2 


( 2 ) 


(b) To solve the Fokker-Planck equation, we follow the method of 
Section S5.C.3. Make a transformation, P(v,t ) = e~ mivl / 2g ^(v, t). If 
we plug this into Eq. (2), we obtain the following equation for ^(v, t ), 


mdSb (\ 1 2 „< 9 2 \ t 

7 dt \2 4 A dv 2 J 


(3) 


where A = (g/2mj). The operator// = ~ — (l/4A)v 2 4 -A(d 2 /dv 2 ) is 
self-adjoint and has eigenfunctions (f>„(v) (n = 0, 1, 2, ... , oo) which 
can be expressed in terms of Hermite polynomials [20]. The nth 
normalized eigenfunction of H is 


= —L=H n (-^=) 
\/ 2 n n\\f2 , nA \y2A) 


(4) 


where H n (y) is the nth-order Hermite polynomial and can be written 
H„iy) = (— 1 ) n e yl (d n /dy n )e~ y2 . The operator H satisfies the eigen- 
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value equation H(f> n (v ) = — n0„(v), so the nth eigenvalue is A„ = — n. 
The eigenfunctions satisfy the orthonormality condition 


• 00 

dv(j> n '{y)(f> n {v) = 

— 00 


(5) 


If we redefine the time to be r = ( 7 /m)t, then we obtain the following 
spectral decomposition of \&r(v, t). 


^(v,*) = nT <f>n( v )- 

n = 0 

The probability P(v, r) is 

00 

P(v,t) = ^2a n e~ nr (j)o(v)(l) n (v). 


(6) 


(7) 


*=o 


The initial probability distribution is P(v, 0) = <5(v — vo). This gives 
a n = </>n(vo)/ 0 o(vo) and we obtain 


p (v,r) = ^ e nT M v o)M v )M v )/M v o) 


n = 0 

1 


,-v*/2A 


n=0 

We now can make use of an identity 




Via) n \V2A)‘ 


(9) 


\/l - z 


1 ^+y»-2 ^) >| =e .^ 


1 - r 


V 


£ 

«=0 


2 n n\ 


H n (x)H n (y ) 
( 10 ) 


(see Ref. 21, page 786). Using this identity, the probability can be 
written 


1 


p(v ’ T) = vm - «-*> exp ( iSr^y ) ■ (11) 

Thus, the probability density has the form of a Gaussian. The average 
velocity decays as (v(t)) = y 0 e~ T , and t he velocity distribution has a 
standard deviation, cr v = y/A(l — e~ 2r ). In the limit of “long time” 
the probability density takes the form 


P(v,t) 


:exp 


/— m 2 v 2 \ 

\ZykVr) 


( 12 ) 


yfhrjkVfJm 1 

Thus, for large friction coefficient, 7 , the relaxation time for the 
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velocity distribution goes as 7 1 while the relaxation time for the 
spatial distribution goes as 7. 


^ S5.D. Approximations to the Master Equation [3, 4] 

The master equation, under some conditions, may be reduced to a Fokker- 
Planck equation. For example, if the step size shrinks and the transition rate 
grows as a small parameter decreases to zero, then we can use the Kramers- 
Moyal expansion to obtain a Fokker-Planck equation from the master equation. 
We have already used this fact in Chapter 4 when we obtained the continuum 
limit for discrete random walks. 

Let us consider a random walk in which the step size is given by A. The 
master equation can be written 

d P|( ” A,t) = £ [i> 1 (mA,/) Wmi „(A)-P l (nA,/) w „, m (A)], (5.158) 

m =— oo 

where Pi(nA, t) is the probability to find the walker at point x = nA at time t. 
Let us choose the transition rate to be 

wv,n( A) = — 4- 8 n ',n- 1)- (5.159) 

Then the master equation becomes 

~ [P, ((« + 1)A, 0 + P, ((« - 1) A, t) - 2 P, (nA, r)]. (5. 160) 

We now let x = nA in Eq. (5.159) and let A — > 0. To determine what becomes 
of the left-hand side in this limit, we expand it in a Taylor series in the small 
parameter, A keeping x = nA. Then 


dPx ^ — = 1™ 2^2 [ p i (^ + A, 0 + Pi (x - A, /) - 2Pi (x, r)] 


= lim ~ 
A— >0 A 2 


OP A 


P,(x,t) + (^)^A + - 


1 (d 2 P{\ 


dx 2 ) A=0 


A 2 + 




d 2 P i 
dx 2 


(5.161) 


Thus, in the limit A — ► 0 the random walk is described by a simple Fokker- 
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Planck equation. Higher-order derivatives in the Taylor series do not contribute 
because their coefficients go to zero. Lower-order terms in the Taylor series 
cancel. We see that the condition to obtain a Fokker-Planck equation is that the 
step size, A, decreases and the transition rate, w m „(A), increases as 1/A 2 . This 
is a simple example of a Kramers-Moyal-type expansion [25, 26]. 

It is useful to consider a more general example. Let us write the master 
equation for the continuous stochastic variable x : 

‘00 

dx'[P\(x / , t)w(x'\x) - Pi(x, t)w(x\xf)], (5.162) 

— OO 

where vv(x'|a:) is the transition rate. Let y = xf — x denote the change in 
stochastic variable x at a transition, and introduce the notation r(jc,y) = 
w(x|x 4- y) so that t(x — y,y) = w{x — y|y). In Eq. (5.162), make the change of 
variables x! = x + y, and in the first term under the integral let y — > —y. Then 
the master equation can be rewritten 

• OO 

dy[P\{x-y,t)T(x-y,y) - Pi(x)r(x,y)}. (5.163) 

-00 

We can now expand P\(x — y,t)r(x — y,y) in a Taylor series in y. This, of 
course, only makes sense if the function r(x,y) is sharply peaked about y — 0. 
Equation (5.163) takes the form 




dP\ (s, t) 

dt 




( -y) n d n 

n\ dx n 


{Pi{x, t)r(x,y)) 



00 




n= 1 


(-y) n d n 

n\ dx n 


(Pi{x, t)r(x,y)). 


P i(x, t)r(x,y ) j 

(5.164) 


Thus, 


dPi(x,t) A (- \) n d n 


dt 


=E— 

n=l 


where a n (x) is the nth moment, 


<*„(*) = j 


dyy n r(x,y) = 


dyy n w(x|x4-y). 


(5.165) 


(5.166) 


Equation (5.165) is the Kramers-Moyal expansion of the master equation. It 
only has meaning for those special forms of w(x|x + y) = r(x,y) for which the 
infinite series of higher-order derivatives truncates. We shall now give an 
example of such a case. 
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Let us choose w(x\x + y) = r(x , y) to be of the form 


w(4r + y) = T(x,y) = 



(y-A(x) A 2 ) 2 \ 

a 2 y 


Then 


and 


( 00 

dyyw(x\x + y) = A(x), 

— 00 


a 2 = 


=i 


dyy 2 w(x|.x 4- y) = \ + A 2 A 2 (x). 


(5.167) 


(5.168) 


(5.169) 


Higher-order odd moments are identically zero, and higher-order even moments 
are proportional to powers of A. Thus, for the choice of w(x|jc -f y) given in Eq. 
(5.167), as A —» 0, the master equation reduces to 


dPi(x,t) _ .dPifat) 1 d 2 Pi(x,t) 
dt [X) Ox 4 dx 2 


(5.170) 


For this process the step size decreases because the width of the Gaussian 
shrinks, and the transition rate increases as A — ► 0. If this occurs in the 
particular manner shown in Eq. (5.167), then the Kramers-Moyal expansion has 
meaning. 

For the case when the step size in a master equation cannot be made 
arbitrarily small, as is the case for chemical reactions or population dynamics, 
the Kramers-Moyal expansion of the master equation may not give a good 
approximation. Then one must use some other approximation scheme or solve 
the equation numerically. An alternative approximation scheme has been 
introduced by van Kampen. He has shown that it is still possible to approximate 
the master equation by a Fokker-Planck equation if the system has a large 
parameter, such as a volume or total particle number, and if the transition rates 
depend on that large parameter in a certain way. A full discussion of this 
method may be found in van Kampen ’s book [3]. 
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PROBLEMS 

Problem 5.1. Urn A initially has one white and one red marble, and urn B initially has 
one white and three red marbles. The marbles are repeatedly interchanged. In each step 
of the process one marble is selected from each urn at random and the two marbles 
selected are interchanged. Let the stochastic variable Y denote “configuration of the 
urns.” Three configurations are possible: (1) Urn A — 2 white balls, Urn B — 4 red balls; 



280 


STOCHASTIC DYNAMICS AND BROWNIAN MOTION 


(2) Urn A — one white and one red ball, Urn B — one white and three red balls; (3) Urn 
A — two red balls, Urn B — two white and two red balls. We shall denote these three 
realizations as y(l), y(2), and y(3), respectively, (a) Compute the transition matrix, Q, 
and the conditional probability matrix, P^ok). (b) Compute the probability vector, 
(P(j)|, at time s, given the initial condition stated above. What is the probability that 
there are 2 red marbles in urn A after 2 steps? After many steps? (c) Assume that the 
realization, y(n), equals n 2 . Compute the first moment, (y(s)), and the autocorrelation 
function, (y(O)y(j)), for the same initial conditions as in part (b). 

Problem 5.2. Three boys, A, B, and C, stand in a circle and play catch (B stands to the 
right of A). Before throwing the ball, each boy flips a coin to decide whether to throw to 
the boy on his right or left. If “heads” comes up, the boy throws to his right. If “trials” 
comes up, he throws to his left. The coin of boy A is “fair” (50% heads and 50% tails), 
the coin of boy B has heads on both side, and the coin of boy C is weighted (75% heads 
and 25% tails), (a) Compute the transition matrix, its eigenvalues, and its left and right 
eigenvectors, (b) If the ball is thrown at regular intervals, approximately what fraction of 
time does each boy have the ball (assuming they throw the ball many times)? (c) If boy 
A has the ball to begin with, what is the chance he will have it after two throws? What is 
the chance he will have it after s throws? 

Problem 5.3. A trained mouse lives in the house shown in the Fig. 5.4. A bell rings at 
regular intervals (short compared to the mouses lifetime). Each time it rings, the mouse 
changes rooms. When he changes rooms, he is equally likely to pass through any of the 
doors of the room he is in. Let the stochastic variable Y denote “mouse in a particular 
room.” There are three realizations of Y “mouse in room A,” “mouse in room B,” and 
“mouse in room C,” which we denote as y(l), y(2), and y(3), respectively, (a) Compute 
the transition matrix, Q, and the conditional probability matrix, P(jok)- (b) Compute the 
probability vector, (P(j)|, at time s, given that the mouse starts in room C. 
Approximately what fraction of his life does he spend in each room? (c) Assume that 
the realization, y(n), equals n. Compute the first moment, (y(j)), and the autocorrelation 
function, (y(O)y(j)), for the same initial conditions as in part (b). 

Problem 5.4. The doors of the mouse’s house in Fig. 5.4 are fixed so that they 
periodically get larger and smaller. This causes the mouse’s transition probability 
between rooms to become time periodic. Let the stochastic variable Y have the same 
meaning as in Problem 5.3. The transition matrix is now given by 
Qi,i(s) = Q 2 ,i(s) = £> 3 , 3 (s) = o, QiM = cos 2 (7ts/2), QiM = sin 2 (7Tj/2), 

QiM = \ + \ sin 2 (7rj/2), Q 2 ,3(s) = l + ^cos 2 (ttj/2), 0 3j i(j) = ±cos 2 (7r.y/2), and 

03,2 (s) = j + 1 sin 2 (7rj/2). (a) If initially the mouse is in room A, what is the 



Fig. 5.4. Mouse’s house. 
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probability to find it in room A after 2s room changes? In room B? (b) If initially the 
mouse is in room B, what is the probability to find it in room A after 2s room changes? 
In room B? 

Problem 5.5. Consider a discrete random walk on a one-dimensional periodic lattice 
with 2N + 1 lattice sites (label the sites from —N to N ). Assume that the walker is 
equally likely to move one lattice site to the left or right at each step. Treat this problem 
as a Markov chain, (a) Compute the transition matrix, Q, and the conditional probability 
matrix, P (jo|s). (b) Compute the probability P\ ( n , s) at time s, given the walker starts at 
site n = 0. (c) If the lattice has five lattice sites (N = 2), compute the probability to find 
the walker on each site after s — 2 steps and after s = oo steps. Assume that the walker 
starts at site n — 0. 


Problem 5.6. At time t , a radioactive sample contains n identical undecayed nuclei, 
each with a probability per unit time, A, of decaying. The probability of a decay during 
the time t — ♦ t + At is XnAt. Assume that at time t = 0 there are no undecayed nuclei 
present, (a) Write down and solve the master equation for this process [find P\ (n, f)]. (b) 
Compute the mean number of undecayed nuclei and the variance as a function of time, 
(c) What is the half-life of this decay process? 

Problem 5.7. Consider a random walk on the lattice shown in Fig. 5.5. The transition 
rates are wi )2 — vvi )3 = \ , w 2) i = vv 2 , 3 = w 2 ,4 — 5 , w 3 i - w 3>2 = w 3) 4 = | , W4 )2 = 
W43 = 5, and VV14 = W41 = 0. (a) Write the transition matrix, W, and show that this 
system obeys detailed balance, (b) Compute the symmetric matrix, V, and find its 
eigenvalues and eigenvectors, (c) Write P\ (n, i) for the case P\ (n, 0) = 6 n 1 . What is 

*( 4 , 0 ? 

Problem 5.8. Consider a random walk on the lattice shown in Fig. 5.6. The site, P, 
absorbs the walker. The transition rates are wq )2 = wj 3 =5, w 2j i = w 23 = w 2 /> = 5, 
w 3) i = w 3 2 = w 3 p = and vv/y = vv/> 2 — (a) Write the transition matrix, M, and 

compute its eigenvalues and and left and right eigenvectors, (b) If the walker starts at 
site n = 1 at time t — 0, compute the mean first passage time. 

Problem 5.9. Let us consider on RL electric circuit with resistance, R, and inductance, 
L, connected in series. Even though no average electromotive force (EMF) exists across 
the resistor, because of the discrete character of the electrons in the circuit and their 
random motion, a fluctuating EMF, £(t), exists whose strength is determined by the 
temperature, T. This, in turn, induces a fluctuating current, I(t), in the circuit. The 
Langevin equation of motion for the current is 


j d M 

dt 


+ */(;) = £(t), 
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If the EMF is delta-correlated, (£(* 2 )£(*i))£ = gS{ti — *i) and (£(0)^ = 0, compute, g, 
and the current correlation function, <{/(f 2 )/(^i ))^)r* Assume that the average magnetic 
energy in the inductor is \L{Iq) t = \k B T and (Iq) t = 0, where 7(0) = 7 0 . 

Problem 5.10. Due to the random motion and discrete nature of electrons, and LRC 
series circuit experiences a random electromotive from (EMF), £(f). This, in turn, 
induces a random varying charge, Q(t), on the capacitor plates and a random current, 
I(t ) = ( dQ(t)/dt ), through the resistor and inductor. The random charge, Q(t), satisfies 
the Langevin equation 


d 2 Q(t) 
dt 2 


+ R 


dQ(t ) 
dt 



= £(')• 


Assume that the EMF is delta-correlated, {£(^)£(*i))£ = g8{ti ~ t\), and (£(t))^ = 0. 
Assume that the circuit is at temperature T and that the average magnetic energy in the 
inductor and average electric energy in the capacitor satisfy the equipartition theorem, 
\L(ll) T = \k B T and ^ ( Ql) t = \k B T, where Q{ 0 ) = Qq and 7 ( 0 ) = 7 0 . Assume that 
(C?o)r — (h)r ~ (Qok/r = 0- ( a ) Compute the current correlation function, 

((I(t 2 )I(h))^) T . (b) Compute the variance of the charge distribution, 

(((GW - Qo)\)t ■ 

Problem 5.11. Consider a Brownian particle of mass m moving in one dimension in the 
presence of a constant force / 0 (such as a gravitational or electric field) in a fluid with 
force constant 7 and in the presence of a delta-correlated random force £(f) such that 
(£(^)£(^t)}^ = gS{t 2 — fi) and (£(*))£ = 0 . Assume that the velocity and displacement 
of the Brownian particle at time t = 0 are vo and *o, respectively, (a) Compute the 
velocity correlation function (v(r 2 )v(ri))^. (b) Compute the variance ((x(t) - x 0 ) 2 } ( . 

Problem S5.1. Consider a linear birth-death process which includes the possibility of a 
change in population due to immigration in addition to the change that occurs due to the 
birth and death of individuals in the population. Assume that at time t the population 
has n members. Let aAt be the probability that one individual enters the society due 
to immigration in time t — > t + At. Let f3nAt{^nAt) be the probability of a birth (death) 
in time t — > t + At. Assume that at time t — 0 the population has n—m members, 
(a) Write the master equation and compute the generating function, F(z , t) = 
00 (n, f), for this process, (b) Compute the first moment (n(t)), and the 

variance, ( n 2 (t )) — ( n(t )) 2 , for this process. 


Problem S5.2. Consider the chemical reaction 


A + X^A + Y, 

*2 

where molecule A is a catalyst whose concentration is maintained constant. Assume that 
the total number of molecules X and Y is constant and equal to N. k\ (^ 2 ) is t* 16 
probability per unit time that molecule X (Y) interacts with a molecule A to produce a 
molecule Y (X). (a) Find the equation of motion for the probability P\ (n, t) to find n X 
molecules in the system at time t, and find the generating function, F{z : t) = 
z?P\ (n, t). (b) Find the average number of X molecules in the system at time 
t , and find the variance in the distribution of X molecules. In the limit, A — > o o, 
how does the variance for the particle distribution compare with that of a Gaussian 
distribution? 
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Fig. 5.7. Diffusion through a small hole. 


Problem S5.3. Consider a box (A) of volume Q, connected to another box (B) of infinite 
volume via a small hole (cf. Fig. 5.7). Assume that the probability that a particle moves 
from box A to box B in time At is (n/Q)At and that the probability that a particle 
moves from box B to box A in time At is pAt, where p is the density (assumed constant) 
of particles in box B. (a) Write down the master equation for the probability Pi ( n , t) 
to find n particles in box A at time t, and compute the generating function, 
F(z, t) = X^i-oo z”Pi ( n i 0- Assume that there are m particles in box A at time t = 0. 
(b) Compute Pi(n,f) for die same initial conditions as part (a), (c) Find the average 
number of particles in box A and find the variance in particle number as a function of 
time. 

Problem S5.4. Consider the chemical reaction in Problem S5.2 and let N = 3, ki —2, 
and = 1. (a) Write the transition matrix, W and compute its eigenvalues and left and 
right eigenvectors, (b) If initially there are zero X-molecules in the system, what is the 
probability of finding three X-molecules at time t. What is the probability of finding 
three X-molecules at time t = oo? 

Problem S5.5. Consider the following chemical reaction, 

A + M-^X + M and 2X-^E + D, 

where molecules A, M, E, and D are obtained from large reservoirs and can be assumed 
constant, (a) Find the probability to have n X molecules in the system after a very long 
time, t -+ oo. (b) Find the average number of X molecules after a very long time. [Some 
hints: Use the boundary conditions F(l) = 1 and F(— 1) = 0 for the generating 
function, F(z). The generating function can be found in terms of modified Bessel 
functions. The transformation F(z) = y/sG(s), where s = ( 1 +z)/2, might be helpful.] 

Problem S5.6. The motion of an harmonically bound Brownian particle moving in one 
dimension is governed by the Langevin equations, 

m ^j t - = -7 v(f) - mu%x{t) + £(0 and = v(t), 

where v(f) and x(t) are the velocity and displacement of the particle at time t, m is the 
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mass, 7 is the friction coefficient, u> 0 is the natural frequency of the harmonic oscillator, 
and £(t) is a delta-correlated random force. If the particle at time t = 0 is in equilibrium 
with the fluid, compute the variance, {((x{t) - xo) 2 )^) T . [Note that for this case, 
(£(*2)£(fi))£ = 47^5 77 > (?2 —t\), and by the equipartition theorem, ( xq) t = ksT/mul 
and (vo) r = foT/m. Also assume (v 0 ) r = (xo) T = ( xqvo) t — 0.] 

Problem S5.7. Consider a Brownian rotor with moment of inertia, /, constrained to 
rotate through angle, 9, about the z axis. The Langevin equations of motion for the rotor 
are I(du/dt) = -To; -f- £(r) and (d9/dt) = u, where u is the angular velocity of the 
rotor, F is the friction coefficient, and £(r) is a Gaussian white noise torque. The torque 
is delta-correlated, (£(0£(*))e = - t ), where G is the noise strength, (a) For the 

case of large friction coefficient, F, write the Fokker-Planck equation for the probability 
density, P(9, t ), to find the Brownian rotor in the interval 9 — > 9 + d6 at time, t. (b) Solve 
the Fokker-Planck equation assuming that at time t = 0 the rotor is at 9 = 9 0 . (c) 
Compute the probability current at time t. 

Problem S5.8. A Brownian particle of mass m moves in one dimension in the presence 
of a harmonic potential V{x) — \kx 1 , where k is the force constant. The Langevin 
equations are given by m[dv(t)/dt\ — -7 v(t) — dV(x)/dx + £(r) and dx(t)/dt = v(t), 
where 7 is the friction coefficient and £(t) is a Gaussian white noise force. The noise is 
delta-correlated, (C(^)C(O)^ = g^i*' ~ where g is the noise strength, (a) Write the 
Fokker-Planck equation for the probability density, P(x, t), in the limit of large friction 
coefficient, (b) Solve the Fokker-Planck equation and write the solution, P(x,t), for 
arbitrary times. Assume that at time t = 0, the Brownian particle is at x — x 0 . (c) Write 
an approximate expression for P(x, t ) for very long times. 

Problem S5.9. Consider a biased random walk along the x axis with step size A and 
transition rate 


u’(nAln'A) 


A*(nA) (3(nA)\ 
v A A 2 ) 


^n',n+\ “f" 


( a(nA) (3(nA)\ 
V A + A 2 ) 




where — oo^n^oo and — oo^n'^oo. Use the Kramers-Moyal expansion to obtain a 
Fokker-Planck equation for this random walk in the limit A — ► 0 with nA = x. 



THE FOUNDATIONS OF 
STATISTICAL MECHANICS 


6.A. INTRODUCTION 

In Chapter 5 we studied the time evolution of probability distributions in the 
Markov approximation, where the dynamics of the process was determined in 
terms of a single transition probability. The transition probability itself is 
usually determined phenomenologically; and thus the equations we derived, the 
master equation and the Fokker-Planck equation, may be considered as 
phenomenological equations. However, they exhibit exactly the type of 
behavior we need to describe the observed irreversible decay of systems to a 
unique equilibrium state. In this chapter we will set up the machinery for a 
microscopic probabilistic description of matter for both classical and quantum 
mechanical systems. That is, we shall lay the foundations for statistical 
mechanics and we will learn how thermodynamics and irreversible processes 
are thought to arise from the reversible laws of dynamics. 

We want to describe the behavior of systems with a large number of degrees 
of freedom, such as N interacting particles in a box or N interacting objects on a 
lattice. The motion of such objects is governed by Newton’s laws or, 
equivalently, by Hamiltonian dynamics. In three dimensions, such a system has 
3 N degrees of freedom (if we neglect internal degrees of freedom) and 
classically is specified by 6N independent position and momentum coordinates 
whose motion is uniquely determined from Hamiltonian dynamics. If we set up 
a 6iV-dimensional phase space, whose 6 N coordinates consist of the 3N 
momentum and 3,/V-position variables of the particles, then the state of the 
system is given by a single point in the phase space, which moves according to 
Hamiltonian dynamics as the state of the system changes. If we are given a real 
N-particle system, we never know exactly what its state is. We only know with a 
certain probability that it is one of the points in the phase space. Thus, the state 
point can be regarded as a stochastic variable and we can assign a probability 
distribution to the points in phase space in accordance with our knowledge of 
the state of the system. We then can view the phase space as a probability fluid 
which flows according to Hamiltonian dynamics. In this way, we obtain a 
connection between the mechanical description and a probabilistic description 
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of the system. The problem of finding an equation of motion for the probability 
density reduces to a problem in fluid dynamics. In this chapter, we shall also lay 
the foundations for equilibrium statistical mechanics. For classical dynamical 
systems which are ergodic, an equilibrium probability distribution can be 
constructed which gives excellent agreement with experiment. 

When we deal with quantum systems the phase space variables no longer 
commute and it is often useful to use representations other than the coordinate 
representation to describe the state of the system. Thus, we introduce the idea of 
a probability density operator (a positive definite Hermitian operator), which 
can be used to find the probability distribution in any desired representation. We 
then can use the Schrodinger equation to find the equation of motion for the 
probability density operator. 

The JV-body probability density for a classical system contains more 
information about the system than we need. In practice, the main use of the 
probability density is to find expectation values or correlation functions for 
various observables, since those are what we measure experimentally and what 
we deal with in thermodynamics. The observables we deal with in physics are 
generally one- or two-body operators, and to find their expectation values we 
only need reduced one- or two-body probability densities and not the full N- 
body probability density. In the special topics section we shall find that the 
equations of motion for the reduced probability densities form a hierarchy of 
equations called the BBGKY hierarchy (after Bom, Bogoliubov, Green, 
Kirkwood, and Yvon, who discovered it), which makes them impossible to 
solve without some approximation which terminates the hierarchy. This, in fact, 
is a general feature of all reduced descriptions. 

In quantum systems, we cannot introduce probability densities which specify 
both the position and momentum of the particle, because these quantities do not 
commute. However, we can introduce quantities which are formally analogous, 
namely, the Wigner functions. The Wigner functions are not probability 
densities, because they can become negative. However, they can be used to 
obtain expectation values in a manner formally analogous to that of classical 
systems, and the reduced Wigner functions form a hierarchy which in the 
classical limit reduces to the classical BBGKY hierarchy. 

Finally, in the last sections of the special topics section, we shall describe 
conditions under which systems which are governed by the reversible laws of 
Newtonian dynamics display irreversible behavior. 


6.B. THE CLASSICAL PROBABILITY DENSITY [1-3] 

If we view the flow of points in the phase space of a classical 7V-body 
Hamiltonian system as a fluid whose dynamics is governed by Hamilton’s 
equations, then we can derive the equation of motion for the probability density 
of the classical system in phase space. This equation of motion is called the 
Liouville equation. Because Hamilton’s equations preserve volume in phase 



THE CLASSICAL PROBABILITY DENSITY 


287 


space, the probability fluid flow is incompressible. It is also nondissipative, 
unlike the probability flow governed by the Fokker-Planck equation. 

Let us consider a closed classical system with 37V degrees of freedom (for 
example, TV particles in a three-dimensional box). The state of such a system is 
completely specified in terms of a set of 67V independent real variables (p^, q^) 
(p^ and q^ denote the set of vectors p N = (Pi,P 2 , • • • , PaO an d q^ = 
(qi,q 2 , . . . , q^), respectively; p ; and q ; are the momentum and position of 
the Ith particle. If the state vector = X^(p /V , q^) is known at one time, then 
it is completely determined for any other time from Newton’s laws. 

If we know the Hamiltonian, H(X N ,t), for the system, then the time 
evolution of the quantities p/ and q ; (/ = 1 , . . . , TV) is given by Hamilton’s 
equations, 


• _ d?i _ dH N 
** 1 ~ dt dq t 


(6.1) 


and 


• _ d( h _ dH ‘ W 
^ 1 ~ dt dp[ 


(6.2) 


If the Hamiltonian does not depend explicitly on the time, then it is a global 
constant of the motion and we can write 


H n {X n ) = E , (6.3) 

where the constant, E, is the total energy of the system. In this case the system 
is called conservative. 

Let us now associate to the system a 67V-dimensional phase space, F. The 
state vector X N (p N , q N ) then specifies a point in the phase space. As the system 
evolves in time and its state changes, the system point X N traces out a trajectory 
in T-space (cf. Fig. 6.1). Since the subsequent motion of a classical system is 
uniquely determined from the initial conditions, it follows that no two 
trajectories in phase space can cross. If they could, one could not uniquely 
determine the subsequent motion of the trajectory. 

When we deal with real physical systems, we can never specify exactly the 
state of the system. There will always be some uncertainty in the initial 
conditions. Therefore, it is useful to consider X N as a stochastic variable and to 
introduce a probability density p(X N , t ) on the phase space, where p(X N , t)dX N 
is the probability that the state point, X N , lies in the volume element 
X n — ► X n + dX N at time t. (Here dX N = dq x x • ■ • x ^q^Pi x • • • x dp N .) In 
so doing we introduce a picture of phase space filled with a continuum (or fluid) 
of state points. If the fluid were composed of discrete points, then each point 
would be assigned a probability in accordance with our initial knowledge of the 
system and would carry this probability for all time (probability is conserved). 
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Fig. 6.1. Movement of a system point, X jV (t), in a 6N-dimensional phase space 
(?2 > t\). We show only four of the 6N coodinate directions. 


The change in our knowledge of the state of the system with time is determined 
by the way in which the fluid flows. Since for real systems the state points form 
a continuum, we must introduce a probability density, p(X N , t ), on the phase 
space. 

Because state points must always lie somewhere in the phase space, we have 
the normalization condition 


j p(X N ,t)dX N = 1 (6.4) 

where the integration is taken over the entire phase space. If we want the 
probability of finding the state point in a small finite region R of T space at time 
t, then we simply integrate the probability density over that region. If we let 
P(R ) denote the probability of finding the system in region R, then 

P(R) = f p(X N ,t)dX N . (6.5) 

J/? 

If at some time there is only a small uncertainty in the state of the system, the 
probability density will be sharply peaked in the region where the state is 
known to be located, and zero elsewhere. As time passes, the probability density 
may remain sharply peaked (although the peaked region can move through 
phase space) and we do not lose any knowledge about the state of the system. 
On the other hand, it might spread and become rather uniformly distributed, in 
which case all knowledge of the state of the system becomes lost. 
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Probability behaves like a fluid in phase space. We can therefore use 
arguments from fluid mechanics to obtain the equation of motion for the 
probability density (cf. Section S10.A). We will let X N = (q N , p^) denote the 
velocity of a state point, and we will consider a small volume element, Vo* at a 
fixed point in phase space. Since probability is conserved, the total decrease in 
the amount of probability in Vo per unit time is entirely due to the flow of 
probability through the surface of Vo. Thus, 

fp(V o) = || p(X N ,t)dX N = - £ p(X N ,t)X N ■ dS N , (6.6) 

where So denotes the surface of volume element Vo, and dS N is a differential 
area element normal to So. If we use Gauss’s theorem and change the surface 
integral to a volume integral, we find 

If ptX'VjdX" = - [ V x » • (p(X N ,t)X N )dX N , (6.7) 

0 t JV o JVo 

where V x * denotes the gradient with respect to phase space variables 
V X " = ((d/d<Ji)> ( d/dq 2 ), . . . , {d/dq N )\ (d/dpj), . . . , (d/dp N )). We can take 
the derivative inside the integral because V 0 is fixed in space. If we equate 
arguments of the two integrals in Eq. (6.7), we obtain 

|p(XV) + V x » ■ (p(X'V)X") = 0. (6.8) 

Equation (6.8) is the balance equation for the probability density in the 6 N- 
dimensional phase space. 

We can use Hamilton’s equations to show that the probability behaves like an 
incompressible fluid. A volume element in phase space changes in time 
according to the equation 

dX? = f N (t,to)dXl (6.9) 

where ,/ N (t,to), the Jacobian of the transformation, is the determinant of a 
6N x 6/V-dimensional matrix which we write symbolically as 



( 6 . 11 ) 
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if we remember that the product of the determinant of two matrices is equal to 
the determinant of the products. 

Let us now assume that the system evolves for a short time interval 
At = t — t 0 . Then the coordinates of a state point can be written 

Pf' = pj + Pl> + 0(A/) 2 (6.12) 

and 

i!' , = qZ + < A ' + °( At ) 2 - (6.13) 


[Again Eqs. (6.12) and (6.13) have been written symbolically to denote the set 
of 6 N equations for the components of X^.] If we combine Eqs. (6.10), (6.12), 
and (6.13), we find 


/ (t,to) = det 


dp, 0 




At 


dp * 

1+^1 At 

d < 


= 1 + 








At + 0(A/) : 


(6.14) 


(cf. Appendix A). However, from Hamilton’s equations [cf. Eqs. (6.1) and (6.2)] 
we obtain 


d< dpyj 


= 0 


and, therefore, 


f N (t,to) = 1 +0(Ar) 2 . 


(6.15) 


(6.16) 


From Eq. (6.11) we can write 

S N (t, 0) = Jf N (t,to)/ N (to,0) = /"(lo,0)(l +0(Af) 2 ), (6.17) 

and the time derivative of the Jacobian becomes 


d/ N ,. / A '(fo + Ar,0)-/ fl '(r 0 ,0) 

— l lm 

dt At — >o At 


(6.18) 


Thus, for a system whose dynamics is determined by Hamilton’s equations, the 
Jacobian does not change in time and 

/"( f,0) = /' v (0,0) = l. 


(6.19) 



THE CLASSICAL PROBABILITY DENSITY 


291 


Equation (6.19) is extremely important for several reasons. First, it tells us that 
volume elements in phase space do not change in size during the flow (although 
they can change in shape): 

dX? = dX%. (6.20) 

Second, it tells us that the probability behaves like an incompressible fluid since 

V x »X* = 0 (6.21) 

[cf. Eq. (6.15)]. If we combine Eqs. (6.8) and (6.21) the equation of motion for 
the probability density takes the form 

= _x* . v x »p(xV). (6.22) 

Note that Eq. (6.22) gives the time rate of change of p(X N , t) at a fixed point in 
phase space. If we want the time rate of change as seen by an observer moving 
with the probability fluid, then we must find the equation for the total time 
derivative of p(X N ,t). The total or convective time derivative is defined as 

(6 - 23) 


[cf. Appendix A, Eq. (A.5)] and, therefore, from Eq. (6.22) we obtain 


dp(X N ,t) 

dt 


(6.24) 


Thus, the probability density remains constant in the neighborhood of a point 
moving with the probability fluid. 

If we use Hamilton’s equations, we can write Eq. (6.22) in the form 

(6.25) 

where the differential operator, H N , is just the Poisson bracket 



dH N d \ 

dqj dp j) 


(6.26) 


(we put a hat on jf N to indicate that it is differential operator). Equation (6.25) 
is often written in the form 


. dp(x N ,t) 

dt 


= L N p(x N ,t ), 


(6.27) 
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where L N = —iJf N . Equation (6.27) is called the Liouville equation and the 
differential operator, L N , is called the Liouville operator. The Liouville operator 
is a Hermitian differential operator. 

If we know the probability density, p(X N , 0), at time, t = 0, then we may 
solve Eq. (6.27) to find the probability density, p(X N ) t), at time t. The formal 
solution is 


p(X N ,t) = e- iLn, p(X N ,0). (6.28) 

A probability density, p s (X N ), which remains constant in time must satisfy the 
condition 


L N ps(X N ) = 0 (6.29) 

and is called a stationary solution of the Liouville equation. The Liouville 
equation is particularly simple to solve explicitly if the mechanical system is 
integrable and one can make a canonical transformation from phase variables, 
(Pi » • • • » Pat, fli » • • • , <Lv)> to action-angle variables (J b . . . , J*, 0i , . . . , 0 N ) [4, 
5]. We show how this is done in Exercise 6.1. 

The Liouville equation, (6.27), is the equation of motion for the probability 
density of a classical dynamical system. Its properties differ in an important 
respect from the properties of the Fokker-Planck equation in Chapter 5. The 
Liouville operator is Hermitian, whereas the Fokker-Planck operator is not. 
Thus, the solution of the Liouville equation will oscillate and not decay to a 
unique equilibrium state. Furthermore, if we reverse the time in Eq. (6.28), we 
do not change the equation of motion for the probability density since the 
Liouville operator changes sign under time reversal. This is different from the 
Fokker-Planck equation, which changes into a different equation under time 
reversal. Thus, Eq. (6.27) does not admit an irreversible decay of the system to a 
unique equilibrium state and thus cannot describe the decay to equilibrium that 
we observe so commonly in nature. And yet, if we believe that the dynamics ot 
systems is governed by Newton’s laws, it is all we have. The problem of 
obtaining irreversible decay from the Liouville equation is one of the central 
problems of statistical physics and is one that we shall say more about in 
subsequent sections. 


■ EXERCISE 6.1. Consider a particle which bounces elastically and 
vertically off the floor under the influence of gravity (assume no friction 
acts). At time t = 0, the particle is located at z = 0 and has upward 
momentum, p = po. It rises to a maximum height, z = h. Solve the Liouville 
equation to find the probability density, p(p,z,t), at time t. 

Answer: The Hamiltonian for the particle can be written H = 
{p 2 /2m) + V(z) = E, where V(z) = mgz for z>0 and V(z) = oo for 
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z < 0. The turning point (the point where the momentum goes to zero) of the 
orbit is at z = h, the energy is E = mgh, and the initial momentum is 
po = m.\J2gh. Hamilton’s equations are p = — ( dH/dz ) = —mg and 
z = (p/m). Hamiltonian’s equations may be solved to give 


z(t) — \/2ght - 1 1 2 and p(t) = my/lgh - mgt 


for 0 < t < T, 


( 1 ) 


where T = 2y/ (2 h/g) is the period of the motion. Both z(t) and p(t) are 
periodic functions of time with period T. 

We can also describe this system in terms of action-angle variables. The 
action is 


1 




= — f * dzpE^z = J-J- E 3 ' 2 

n Jo 


37rg V m 


so the energy, as a function of action, is 

'9ir 2 g 2 m' 1/3 


H = 


8 


J 2/ 3 = E. 


(2) 


( 3 ) 


From Hamilton’s equations we have J = ( dH/d9 ) = 0, so the action is a 
constant of the motion, and the angle evolves according to the equation 


A dH _ fm\ V 2 

6 = m =' “ {J) = ^(ie) - 


* ghn \'> 3 

it ) =7r< 


( 4 ) 


We see again that the period of the motion is T = (2n /uS) = 2y/(2 h/g). If 
the angle variable at time t = 0 is 9 = 0, then at time f it is 6 = a it. We can 
now use Eqs. (1), (3), and (4) and the fact that t = 0/ui to write the canonical 
transformation from phase space variables (p,z) to phase space variables 
(J, 9). We find 


7r \ m J 


( 5 ) 


and p(J , 8) = — 


0) 


for 0 < 9 < 2n. 


Because the transformation (5) is canonical, the Jacobian of the 
transformation 


\dj do J 


( 6 ) 


is equal to one as can be easily shown. Thus, dpdz = dJd9 and 
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<5(p - Po)8(z - zq) = &{J - Jo)6(0 - 9 0 ), where p 0 = p(J 0 , 0 O ) and zq = 
z{Jo,9o). We can write the Liouville equation in terms of the canonical 
variables (p,z), 


dp .dp .dp 
~dt +P dp + Z dz~°' 


(?) 


or in terms of canonical variables (7, 9), 


9p ' + e dp '-o 
-dF + e ~d8-°’ 


( 8 ) 


where p(p,z,t) = p'(J,9,t ) are the probability densities. The initial condi- 
tions are given by p(p, z,0) = 6{p - p 0 )S(z) and p'(J, 9 , 0) = 6(J - Jo)6(0), 
where J Q = (2m/37r) y/2gh?. Because the probability density, p'(J,9,t), is a 
periodic function of 9, we can expand it in a Fourier series 

t oo 

= p n (j,t) ei " e . (9) 

' n =— oo 

I If we plug Eq. (9) into the Liouville equation, (8), and use the fact that 
t Y^L-oo eln ° — 27r<5(0) and (1/27 t) J Q 27r d9e md = 6 n p, then we find that the 
| Fourier coefficient evolves in time as 

Pn(J,‘) = P.(J, ( 10 ) 

where 9 = u ;(/). Therefore 


1 00 

P \j,9,t) = - (ii) 

n —— oc 

If we now make use of the initial conditions, we obtain 

p'{J, 9, t) = 6(J - Jo )6(0 - u(J)t) ( 1 2) 

and 


p(p,z,t) = 6{p - p(J 0 ,ut))6(z- z(Jo,vt)), (13) 

where p(J 0 ,ut) and z{Jq, cot) are given by Eq. (5). 


We shall often be interested in obtaining the expectation value of phase 
functions, O n (X n ). The expectation value of O^(X^) at time t is given by 

(0(0) = f dXi • • • j dx N o N (x N )p(x N , 0 

= [ dX, • • • f dX N O rJ (X r '')e-‘ L "'p(X N , 0). 


(6.30) 
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We have written the expectation value in the “Schrodinger” picture. That is, we 
have allowed the state function, ^(X^, t), to evolve in time and we have kept the 
phase function, O^(X^), fixed in time. We could equally well allow the phase 
function to vary in time and keep the state function fixed. We note that the 
Liouville operator, L N , contains derivatives with respect to and q N . If we 
expand the exponential in a power series and integrate by parts, there will be a 
change of sign for each partial derivative and we find 

(0(0) = [dx, • • • [ dX N 0 N (X N , t)p(X N , 0) 

r f . (6-31) 

= UX, • ■ • dX N p(X N , 0 )e +L "'0 N (X N , 0). 

(we assumed that p(X N \ 0) — * 0 for large X N ). Thus, we obtain the classical 
version of the “Heisenberg” picture. We see that phase functions and state 
functions evolve according to different laws. The equation of motion of a phase 
function is given by 

» — fi 0 - (632) 

where Eq. (6.32) gives the evolution of O^X^, t) at a fixed point in phase 
space. 

i ■ EXERCISE 6 . 2 . A system of N particles has a Hamiltonian 
H — Ym=\ P i/ 2m + V(|q t . — q ; |). The phase function which gives 

j the particle density at position R in configuration space is 
; «(q^,R) = Y/!i = i ~ R). Write the equation of motion of ^(q^jR). 


Answer: The equation of motion of n(q Ar ,R) is given by Eq. (6.32). Since 
J n( q N : R) does not depend on momentum, Eq. (6.32) reduces to 


d n . d . 

(1) 

If we now replace the differentiation with respect to qj by a differentiation 
with respect to R, we obtain 

S=-v»- 

i=i 

(2) 

(since q f = (p ,/m)) or 


g=-V R .J(p",q";R), 

(3) 


296 


THE FOUNDATIONS OF STATISTICAL MECHANICS 


where J(p A ',q A ';R) = Ef =1 (p f / is the particle current phase 

function. Equation (3) is a balance equation which reflects the conservation 
of particle number on the microscopic level. 


It is interesting to note that the probability density, p(X N ,t), is often 
interpreted in terms of an “ensemble” of systems. This was the view originally 
taken by W. Gibbs. Let us consider an ensemble of 77 identical systems (77 very 
large). If we look at each system at a given time, it will be represented by a 
point in the 6/V-dimensional phase space. The distribution of points representing 
our ensemble of systems will be proportional to p(X N , t ). That is, the density of 
system points in phase space will be given by r)p(X N , t ). 

6.C. ERGODIC THEORY AND THE FOUNDATIONS OF 
STATISTICAL MECHANICS [6-13] 

The subject of ergodic theory was primarily the domain of mathematicians until 
the advent of modem computers. However, in recent years it has become an 
even more important subject of research because of its importance in such 
diverse fields as celestial mechanics (stability of the solar system) and 
chemistry (stability of isolated excited molecules) and because it asks questions 
which lie at the very foundations of statistical mechanics. 

As we shall see, the flow of probability in phase space is of a very special 
type. There are absolutely no diffusion processes present. Historically, two 
types of probability flow have been important in understanding the behaviour of 
phase space, namely, ergodic flow and mixing flow. For systems with ergodic 
flow, we obtain a unique stationary probability density (a constant on the energy 
surface) which characterizes systems with a fixed energy at equilibrium. 
However, a system with ergodic flow cannot necessarily reach this equilibrium 
state if it does not start out there. For decay to equilibrium, we must have at 
least the additional property of mixing. Mixing systems are ergodic (the 
converse is not always true, however) and can exhibit random behavior. In 
addition, reduced distribution functions can be defined which decay to an 
equilibrium state. We give examples of mixing flow in the special topics 
Section (S6.D). 

Ergodic and mixing behavior for real systems is difficult to establish in 
general. It has been done only for a few model systems. However, there is a 
large class of conservative systems, the anharmonic oscillators, which are of great 
importance in mechanics, chemistry, and the theory of solids. These systems are 
neither ergodic nor mixing but exhibit behavior reminiscent of both in local 
regions of their phase space. They have been studied extensively with computers 
in recent years and give great insight into the behavior of flows in phase 
space and the possible mechanism behind the irreversibility we observe in 
nature. We briefly discuss such systems in the special topics in Section (S6.E). 
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Let us now define ergodic flow. Consider a Hamiltonian system with 3N 
degrees of freedom with Hamiltonian H(p N ,q N ) — E. If we relabel the 
momentum coordinates so p\ — p x \, p 2 = p y<x , p 3 = p Z)U P\=p x p,..., 
Pw — Pz,n (with similar relabeling for the position coordinates), then 
Hamilton’s equations can be written 


dqi 

{dH/dpi) 


dqiN 


(dH/dp 3N ) 
dq 3 N 


(dH/dq 3N ) 


= dt. 


dpi 

(dH/dqi) 


(6.33) 


Equation (6.33) provides us with 6N — 1 equations between phase space 
coordinates which, when solved, give us 6N — 1 constants, or integrals, of the 
motion, 


fiipii • • • ,P3N,qU‘ ■ • i qw) = C,-, (6.34) 

where i = 1,2, . . . ,6/V — 1 and C, is a constant. However, these integrals of the 
motion can be divided into two kinds: isolating and nonisolating. Isolating 
integrals define a whole surface in the phase space and are important in ergodic 
theory, while nonisolating integrals do not define a surface and are unimportant 
[6, 14]. One of the main problems of ergodic theory is to determine how many 
isolating integrals a given system has. An example of an isolating integral is the 
total energy, H(p N ,q N ) = E. For N particles in a box, it is the only isolating 
integral (at least for hard spheres). 

Let us consider a system for which the only isolating integral of the motion 
is the total energy and assume that the system has total energy, E. Then 
trajectories in T space (the 6A-dimensional phase space) which have 
energy, E, will be restricted to the energy surface, S E . The energy surface, 
S E , is a (6N — 1) -dimensional “surface” in phase space which exists 
because of the global integral of the motion, H(pi , . . . ,p 3N , q x , . . . , q 3N ) = E. 
The flow of state points on the energy surface is defined to be ergodic 
if almost all points, X(pi, . . . ,p 3 N,qu • • • ,q 3 N), on the surface move in 
such a way that they pass through every small finite neighborhood, R E , on 
the energy surface. Or, in other words, each point samples small neighbor- 
hoods over the entire surface during the course of its motion (a given 
point, X(p\, . . . ,p 3N ,q\, . . . ,q 3N ) cannot pass through every point on the 
surface, because a line which cannot intersect itself cannot fill a surface of 
two or more dimensions). Note that not all points need sample the surface, 
only “almost all.” We can exclude a set of measure zero from this 
requirement. 

A criterion for determining if a system is ergodic was established by 
Birkhoff [15] and is called the ergodic theorem. Let us consider an integrable 
phase function f(X N ) of the state point X N . We may define a phase average of 
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the function f(X N ) on the energy surface by the equation 

(f)s = j s m N )dS E = y^y £ 6(H N (X N ) - E]f(X N )dX N , (6.35) 

where dS E is an area element of the energy surface which is invariant (does not 
change size) during the evolution of the system and J2{E) is the area of the 
energy surface and is defined as 

£(E) = dS E = [ 6(H n (X n ) - E)dX N (6.36) 

Se Jr 

(we are using the notation of Section 6.B). We may define a time average of the 
function f(X N ) by the equation 

t t to+T 

</> r = lim - f(X N (t))dt (6.37) 

T ^°° 1 J/o 

for all trajectories for which the time average exists. Birkhoff showed that the 
time average in Eq. (6.37) exists for all integrable phase functions of physical 
interest (that is, for smooth functions). 

It terms of averages, the ergodic theorem may be stated as follows: A system 
is ergodic if for all phase functions, f(X N ): (i) the time average, (f) T , exists for 
almost all X N (all but a set of measure zero), and (ii) when it exists it is equal to 
the phase average, (f) T = (f) s . 

To find the form of the invariant area element, dS E , let us first write an 
expression for the volume of phase space, f2(Zs), with energy less than E — that 
is, the region of phase space for which 0 < H N (X N ) < E. We shall assume that 
the phase space can be divided into layers, each with different energy, and that 
the layers can be arranged in the order of increasing energy. (This is possible for 
all systems that we will consider.) The volume, Q(E), can then be written 

f 1(E) = f dX N = f dA H dn H , (6.38) 

J0<H n (X n ) <E J0 <H n (X n ) <e 

where dAn is an area element on a surface of constant energy and dnu is normal 
to that surface. Since VxH N is a vector perpendicular to the surface 
H n (X n ) ^constant, we can write dH N = \VxH N \dnu and the volume becomes 

Q(E) = J dH N Y^(H N ), (6.39) 

where 

16 * 
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is a function of H N and is an invariant measure of the surface area for a given 
value of H N . If we take the derivative of Q(Zs), we find 


Sl \VxH n \h=e 


(6.41) 


The area, ^2{E), is called the structure function. By the same argument, if we 
wish to take the average value of a function f(X N ) over the surface, we can 
write 

/ /*\ If dA-E 1 d f r/vN\ i-*tN 




Thus, the differential 


I Vx#V =£ Z(E)dE Jo < H n (X n ) < E 


f(X N )dX * 


dSs = 




(6.42) 


(6.43) 


is the invariant surface area element. 


■ EXERCISE 6.3. Compute the structure function for a gas of N 
noninteracting particles in a box of volume V. Assume that the system has a 
total energy E. 

Answer: The Hamiltonian for the gas is 

3N n 2 

£ t = E - « 

i=l 

The volume of phase space with energy less than E is 

O(E) = j dq l • ■ • j dq N \dp l --\dp N for pj 4 |-p^<2mE. 

( 2 ) 

This can be written Q(E) = V N Q P , where 

= JrfPi ’ • • | dp N Q(R 2 - p? Ptf). (3) 

is the volume enclosed in momentum space and R 2 = 2mE. The volume in 
momentum space, Vt p , has the form Q p = A 3N R 3N . Let us find A 3N . This can 
be done by a trick. First do the integral 


j d P\, 

J — OC 


dp3N,z exp[-(p? ., + • • ■+p\ N ,z)} 


dpe p 2 ] =7t 3N / 1 . 
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Next note that (< d.Cl p /dE ) = J dp x • • • J d p N 8(R 2 — Pj 


pj) so that 


dR^e -” 2 

dR 


dpi r 


3NA^n 


dp3N,z ex p[— (Pi )X + • • • +P 2 3N , z )} 


dRR^-'e-* 2 =^NA 3 n t(^n), 


(5) 


where r(x) is the gamma function. If we equate Eq. (4) to Eq. (5), we find 

27j3N/2 

a 3n - 37 vr ( 3 7 V / 2 ) ■ 

Thus, the volume of the region of phase space with energy less than E is 

2V N 7T 2N/2 R 3N _ V N (2m mE) 3N/2 
' 3NT(3N/2) " r(3W/2+l) ’ ^ ' 

The structure function, ]T(E), equals (dQ(E)/dE). 


If a system is ergodic, the fraction of time that its state, X N (p N , q^), spends 
in a given region R E of the energy surface will be equal to the fraction of the 
surface S E occupied by R E . Let us consider a function 4>(R E ) such that 
4>(R e ) = 1 when X N is in R E and (f)(R E ) = 0 otherwise. Then it is easy to see 
that, for an ergodic system, 


lim 

T — >oo 


t R e 

T 


E(B) ’ 


(6.44) 


where tr e is the time the trajectory spends in R E and ^2 (Re) is the area 
occupied by R E . 

A system can exhibit ergodic flow on the energy surface only if there are 
no other isolating integrals of the motion which prevent trajectories from 
moving freely on the energy surface. If no other isolating integrals exist, the 
system is said to be metrically transitive (trajectories move freely on the energy 
surface). If a system is ergodic, it will spend equal times in equal areas of the 
energy surface. If we perform measurements to decide where on the surface 
the system point is, we should find that result. We can also ask for the 
probability of finding the system in a given region R E of the energy surface. 
Since we have nothing to distinguish one region from another, the best 
choice we can make is to assume that the probability P(R E ) of finding 
the system in R E is equal to the fraction of the energy surface occupied by 
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Re- Thus, 


eW' (6 ' 45) 

From Eq. (6.45) it is a simple matter to write down a normalized probability 
density for the energy surface, namely, 


p(X",S £ ) = ^. (6.46) 

Equation (6.46) is called the fundamental distribution law by Khintchine and 
called the microcanonical ensemble by Gibbs. Since it is a function only of the 
energy, it is a stationary state of the Liouville equation (6.27). It says that all 
states on the energy surface are equally probable. Equation (6.46) forms the 
foundation upon which all of equilibrium and most of nonequilibrium statistical 
mechanics are built. Its importance cannot be overemphasized. In Exercise 6.2, 
we give a simple example of ergodic flow. 

In this section, we have discussed ergodic theory for classical systems. 
However, it is also possible to formulate analogous definitions for quantum 
systems. In fact, the criterion is rather sample. A quantum system is ergodic if 
and only if the system has a nondegenerate energy spectrum [16]. This means, 
of course, that there are no other observables which commute with the 
Hamiltonian. 

f 

| ■ EXERCISE 6.4. Consider a dynamical flow on the two-dimensional unit 
I square, 0 < p < 1 and 0 <q<U given by the equations of motion, 

| idp/dt) — a and (dq/dt) = \. Assume that the system has periodic 
| boundary conditions, (a) Show that this flow is ergodic. (b) If the initial 
! probability density at time, t = 0, is p{p,q, 0), compute the probability 
j density at time, t. 

| Answer: 

i 

(a) The equations of motion are easily solved to give 

p(t)=Po + at and q{t) = q 0 + 1, (1) 

I 

where po and qo are the initial momentum and position, respectively. 

If we eliminate the time t, we obtain the phase space trajectory, 

P =Po + ot(q — q 0 ), on the square surface. If a is a rational number, j 
I a = ( m/n ) (m and n integers), then the trajectory will be periodic and I 

repeat itself after a period, n. If a is irrational, the trajectory will be 
dense on the unit square (but will not fill it)). A trajectory is shown in 
j the accompanying figure. j 
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When a is irrational, the system is ergodic. Let us show this 
explicitly. Since the phase space is periodic, any integrable function, 
f(p,q), can be expanded in a Fourier series, 

OO 00 

/(?,<?)= E Y. A ^ e2 ” {m,+np) - (2) 


We wish to show that the time average and the phase average of the 
function, /(p, q), are equal for a irrational. The time average is given 
by 


tf> = r“£r 


r [ o+T 


dt E E ^ e 2ni[m(q 0 +t)+n(p 0 +at)} 

m =— oo n=— oo 

1 00 / 00 / 

= A 00 + lim - V V A mn e 2lTi[miqo+to)+n{po+ato ^ 

’ T— +oo / « ■ ^ ' 


m=—oo n =— oo 


X 


( e 2ni(m+n)T _ | 

27ri(m + an) 


(3) 


The primes on the summations indicate that the values m = 0 and 
n = 0 are excluded from the summation. For irrational values of a, 
the denominator can never equal zero. Therefore 

(f)r = ^o,o- (4) 


Similarly, we can show that 

(/),= [ [ dpdqf{p,q) =A 0>0 . (5) 

Jo Jo 

Hence, the system is ergodic (note that dpdq = dpodqo, so area is 
preserved). 
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(b) The probability density satisfies periodic boundary conditions, so we 
can write 

00 oo 

P(P,9,0= £ £ Pn.Aty^e^”- (6) 

m =— oo n=— oo 

It is easy to show that p m , n (t) = p m ,n( 0)e 2m< ' m+an ^ t . Next note that 
Jo dqe llximq = and e2lTimq = <%)• Using these relations, 

we find 

P(p,<l,t) = p(p + at,q + t,0). (7) 

From Eq. (7) we see that ergodicity is not sufficient to cause a system 
which initially has a nonstationary distribution (localized on the 
energy surface) to approach a stationary state (spread throughout the 
energy surface). The probability density in Eq. (7) does not change 
shape with time, but simply wanders intact through the phase 
space. In order to approach a stationary state the system must be 
‘mixing’. Conditions for mixing flow are discussed in special topics 
Section S6.D. 


6.D. THE QUANTUM PROBABILITY DENSITY 
OPERATOR 

For quantum systems, the phase space coordinates do not commute so we 
cannot introduce a probability density function directly on the phase space. 
Because of the noncommutivity of phase space coordinates, we cannot 
simultaneously know the values of all the phase space coordinates. Instead we 
will introduce a probability density operator or density operator as it is 
commonly called. The density operator contains all possible information about 
the state of the quantum system. If we wish we can use it to construct the 
Wigner distribution, which is a function that reduces to the classical probability 
density in the limit where Planck’s constant goes to zero. 

The probability density operator p{t) (we shall call it the density operator), 
contains all possible information about the state of the quantum system. It is a 
positive definite Hermitian operator. Given that we know the density operator, 
p{t), for a system, we can use it to obtain the expectation value of any 
observable O at time t. The expectation value is defined as 

(< 0(t))=TrOp(t ), (6.47) 

where Tr denotes the trace. The density operator is normalized so that 


Tr p(t) = 1. 


(6.48) 
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In Eqs. (6.47) and (6.48), the trace can be evaluated using any convenient 
complete set of states. For example, we could use the eigenstates of the 
operator, O, or any other Hermitian operator, A, which may or may not 
commute with O. We will let (jo,)} and {|a,)} denote the complete orthonormal 
sets of eigenstates of the operators, O and A, respectively, and let {o,} and {a,} 
be the corresponding sets of eigenvalues (0\oi) = o,|o,) and (A|a,) = «,|a,)). 
For simplicity we use Dirac notation (cf. Appendix B). The trace can be 
evaluated in either of these basis. Thus, we can write 


( 0 ( 0 ) = ^2oi(oi\p(t)\oi) ='^2Y^( a i\d\ a j)( a j\p( t )\ a i), (6.49) 

i i j 

where o, = (o,-|D|o,-) and we have used the completeness relation, 
Y^i\ a i)(ai\ = 1, where 1 is the unit operator. The diagonal matrix element, 
(oi\p(t)\oi)((ai\p(t)\ai)), gives the probability to find the system in the state 
|o,)(|a,)), at time t. The set of numbers, (aj\p(t)\ai), forms a matrix 
representation of the density operator (called the density matrix ) with respect 
to the basis states, {)«,)}. The density matrix is a positive definite Hermitian 
matrix. The off-diagonal matrix element, {<>j\p(t)\ai) for i ^ j, cannot be 
interpreted as a probability. 

The introduction of a density operator allows a more general description of a 
quantum system than does the Schrodinger equation. As we shall see, it can also 
be used to describe the equilibrium and near equilibrium states of a many-body 
system. To see this it is useful to distinguish between “pure states” and “mixed 
states.” Consider a quantum system in the state \ip(t)) which evolves according 
to the Schrodinger equation, 


= (6 .50) 

where H is the Hamiltonian operator, and h is Planck’s constant. The density 
operator which describes this “pure state” is simply 

p(t) = \i/j(t))(i/j(t)\. (6.51) 

A “mixed state” is an incoherent mixture of states \4>i(t)): 

p{t) = «’(0I. ( 6 ' 52) 


where p t is the probability to be in the state \i>i(t)), and the states | ^i(O) each 
satisfy the Schrodinger equation. Equilibrium and near-equilibrium states of 
many-body systems are of this type. 
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Using the Schrodinger equation, the equation of motion of the density 
operator is easily found to be 

= (6-53) 

where [H,p(t)\ is the commutator of the Hamiltonian, H, with p(t), and the 
operator L = \ [H], is proportional to the commutator of H with everything on 
its right. The operator L is the quantum version of the Liouville operator and 
is a Hermitian operator. Equation (6.53) is called the Liouville equation 
and gives the evolution of the state of the system (in the Schrodinger picture). 
If the density operator is known at time t = 0, then its value at time t is 
given by 

p(t) = e - ,U p(! 0) = 0) C +(W»> (6.54) 

If we substitute Eq. (6.54) into Eq. (6.47) and use the invariance of the trace 
under cyclic rotation of operators, then Eq. (6.47) takes the form 


(0(t)) = Tr O(t)p(0), 

(6.55) 

where 


d(t) = e +mfit d(0)e- {i/h)fit . 

(6.56) 

Thus the operator, 0, obeys a different equation of motion, 


-i^ = \[H,d(t)}=Ld(t), 

(6.57) 


which is different from that of the density matrix. Equation (6.57) gives the 
evolution of the system in the “Heisenberg” picture. 

It is often convenient to expand the density operator in terms of a complete 
orthonormal set of eigenstates {|E,)} of the Hamiltonian, H, where E t is the 
eigenvalue corresponding to eigenstate | £,•). If we note the completeness 
relation Yhi |£j)(£i| = 1, then Eq. (6.54) takes the form 

m = E E< £ ^( 0 )i^>^ (,/B,(6 - £ ' ) 'i £ -')^i- < 6 - 58 ) 

* j 

From Eq. (6.58), we see that a stationary state, p s , occurs when all off-diagonal 
matrix elements (E, |p(0)|E y ) with i ^ j, of p(0) vanish of E t ^ Ej. Thus, for a 
state with no degenerate energy levels, the stationary state, p s , must be 
diagonal in the energy basis. This can only happen if p s is a function of the 
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Hamiltonian, 


Ps =f(H). (6.59) 

For a system with degenerate energy levels, one may still diagonalize both p 
and H simultaniously by introducing additional invariants of the motion, I 
which commute with each other and with H. Thus, in general, a stationary state 
will be a function of all mutually commuting operators, H,I \, . . . ,/„, 

Ps=f(H,ii,...,I n ). (6.60) 

For systems which approach thermodynamic equilibrium, the stationary state 
may be an equilibrium state. 


■ EXERCISE 6.5. Consider a harmonic oscillator with Hamiltonian, 
H = ( 1 /2m) {p 1 +\ mup-x 1 ). Assume that at time t — 0 the oscillator is a state 
described by the density operator, p(0) = h\/ab{e~ a ^ L e~ b P 2 + e~ b P 2 e* 0 * 1 ), 
where a and b are constants with the dimensions of inverse length squared 
and inverse momentum squared, respectively, (a) Compute the probability to 
find the particle in the interval jc — > x + dx at time t — 0. (b) Write the 
Liouville equation in the position basis, (c) Compute the probability to find 
the particle in the interval x x + dx at time t. 

Answer: 

(a) The probability to find the particle in the interval x — > x + dx at 
time t = 0 is (jr|p(0) |jc)dUr, where |*) is an eigenstate of the 
position operator x. We will use the notation p x \ x { 0) = (x: / |p(0)|x). 
Then 


= hVab(e + e-°^)(Ae- b ?V) 

Vab 


2n 


if* +e~° x ‘) J“ 


dpe~ 


( 1 ) 


1 !<*' -a* 2 . -—ax 2 


= 2 V 7T (e 


+ e aAr ^)expi 


(. x'-xf 
4 bti 1 


where we have used the completeness relation, dp\p)(p\ = 1, for 
momentum eigenstates and the conventions of Appendix B. The 
probability to find the particle in the interval x — > x + dx is p x , x (0)dx, 
where 



Px,x(0) = 


( 2 ) 
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(b) The Liouville equation in the position basis is 

dPx'M 1 


ih- 


dt 


dx! , ((x?\p 2 \x!')p x ?'j C (t) - p x ',x"(t)(x"\p 2 \x)) 


2m, 

-\-^muj 2 (x' 2 -x l )p x >, x (t) 


1 


4m7rh 


dp 


dx"p 2 (e l<J> /^ ^ p x », x (t ) 


- p,A tyww-’)) + V 2 - £)Px'At) 


-h 

4mn 


dp 


dx" 


d : 


dx" 1 


9 i(p/h)( X '-x") 


Px",x(t ) 




+ ] -muj 1 (x’ 1 -x 2 )pxjt). 


(3) 


If we now assume that p x >, x (t) — > 0 as x! — * oo or x — > oo, then we 
can integrate by parts in Eq. (3) and obtain 

:r.dp x >,x(t) h 2 (d 2 p x ,, x {t) d 2 p x ', x {t)\ 1 2(V 2 t 2 n _ (f] 

~dt = 2m \ dj^ dA~) + 2 mW {x ~ X )a '* W 

(4) 

(c) To find p x , x (t), let us first solve the Liouville equation in the basis of 
eigenstates of the Hamiltonian, H = (l/2m)p 2 + \mu) 2 x 2 . From 
Exercise 5.8, we see that the Hamiltonian has eigenvalues 
E n = hu(n + and eigenstates | n), (H\n) = E n \n)), which in the 
position basis are 


(x\n) 


1 /muj\ l / 4 1 
2 n n\\7rh) 


™ y \ p mu)x 1 1 2h 



n 


(5) 


where H n (y/(mu/h)x) is a Hermite polynomial. The Liouville 
equation in the basis of eigenstates | n) is 


m dpn ' d f = fko(n' - n)p n 


where p n ',n{t) = (n'\p(t)\n). The solution to Eq. (6) is 
Pn’,n(t) = e-W-tfrA 0). 


(6) 

(?) 
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Let us now note that p n ^ n (0) = J_^^o J-°i^o(« / K)^' ) x 0 (0)^o|n>. 
Then the density matrix, p x \ x {t), can be written 


00 00 poo 

■W = EE 

n != 0 n = 0 


fhf. 


dxL 


=n n=n J — oo 


1*00 

J— 00 


( 8 ) 


If we use the initial condition in part (a), the probability density, 
p x , x (t), can be written 


00 oo poo 


i j~ 00 00 

v rt '=0 «=0 J 




(•*0 “ -*o) 2 \ 


(9) 


x (e-< + e~<) exp l - 


Abtr 


Let us now use Eq. (5) and note the identity in Eq. (10) of Exercise 
5.8. With this we can write 

00 < !?> 

= (^) v5We”" ( ' 2+ ^ ,/ “ 

n — 0 

X exp (~ -jffW + *o ~ 2*o*e ,<J ')) , (10) 

where f(t ) = i'e~ ,w '/2sin(utf). If we use Eq. (10) to perform the 
summations in Eq. (9), then we are left integrals over x'q and *o which 
can be performed explicitly. After considerable algebra, we find 

(in 


t/^Re 

1 p -a*/B{t) 

V 7 r 

|y*« J 


where Re denotes the real part and 
B{t) = cos 2 (a jt) + 


2/, , a -2' * 


, _ _sin 2 (u;f) + / cos(u;?)sin(u;t). (12) 

bm z uj z mu> 


■ EXERCISE 6.6. An ensemble of silver atoms (each with spin {) is 
prepared so that 60% of the atoms are in the S z = +| eigenstate of S z and 
40% of the atoms are in the S x = — § eigenstate of S x (S x and S z are the x and 
z components of the spin angular momentum operator), (a) Compute the 
density matrix at time t — 0 in the basis of eigenstates of S z . (b) Assume that 
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the silver atoms sit in a magnetic field, B = Bq y, and have a magnetic 
Hamiltonian, H = pS • B, where p is the magnetic moment of a silver atom. 
Compute the density matrix at time t (in the basis of eigenstates of S z ). (c) 
Compute (S z (t)) at time t — 0 and at time t. 

Answer: 

(a) Let \i±) denote the eigenstates of Sk(k = x,y,z) with eigenvalues 
±(/i/2) so (Sjt|&±) = ±{h/2)\k±)). The density operator at time t = 0 
is the mixed state 


P(°) = ^k+)(z+l+^l*-)(*-l- (!) 

Now note that the matrix representation of the components S x ,S y , and 
S z in the basis of eigenstates of S z are 




_h(o -i 
v “ 2 l i 0 


and S z 


h ( 1 0 

2 \0 -1 

(2) 


The eigenstates of S x and S y , in the basis of eigenstates of S z are 



respectively. The eigenstates of S z in the basis of eigenstates of S z 
are 


f(z+\z+)\ 

\(z-\z+)J 



\(z-\z-)j 



(4) 


Using these results we find the initial density matrix in the \z±) 
basis, 


pW-Mj j)- (5) 

(b) The Hamiltonian is H = kfj,hB 0 (\y + )(y + \-\y_)(y_\) (this is its 
spectral decomposition). If we let p y++ = (y + |p|y + ), then the 
Liouville equation for various matrix elements of the density matrix 
in basis |y ± ) is given by 


.9p y ++(t) „ ; dpy+-{t) .dp y -+(t) 

— W~ - 0 ’ -PBoPy + -(t), 


= - pB 0 p y - + (t ) and 


- dp y (t) 

dt ~ ’ 




( 6 ) 
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Now note that the initial density matrix in the basis |y±) is given by 

/>(»)= (i 

V 10 

Using Eq. ( 7 ), we can solve Eqs. (6) 
time t in the basis |y±). We find 

I 3— 2i _+i nB 0 t 
\ 10 e 

We now can transform Eq. (8) to the 

( i+ToCOs(nB 0 t)+±sin(nB 0 t) 

P y ^ sin(/xZ?oO — 5 cos(fiBot) 

( 9 ) 

(c) The average z-component of spin angular momentum at time, t = 0 
is 

«0))=Tr^(0)=lTr(‘ "*) (10) 

The average z component of spin angular momentum at time t is 
3 1 

(S z (t)) = —hcos(fiBot) + -hsin(fiB 0 t). ( 11 ) 


( 7 ) 


and write the density matrix at 


3+2i „—iiiBat 
10 c 

1 

2 


( 8 ) 


basis | z±) to obtain 

Yq sin(/i 5 0 t) — | cos(fiBot) ^ 
^-^cos(fiB 0 t)-\sin(fiB 0 t), 


► SPECIAL TOPICS 

► S6.A. Reduced Probability Densities and the BBGKY 
Hierarchy [2, 5, 17] 

The /V-particle probability density, p(X N ,t), contains much more information 
than we would ever need or want. Most quantities we measure experimentally 
can be expressed in terms of one-body or two-body phase functions. One-body 
phase functions are usually written in the form 

O^x") = f>(X0, 

i=l 


( 6 . 61 ) 
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and two-body phase functions are written in the form 

N{N- 1)/2 

Of 2) (X A ')= Y, 0(X lt Xj), (6.62) 

i<j 


An example of a one-body phase function is the kinetic energy, pj /2/w, °f 

an A-particle system. An example of a two-body phase function is the potential 
energy, y(|q. — q^|), of an TV-particle system. To find the expectation 

value of a one-body phase function, we only need to know the one-body 
reduced probability density. Similarly, to find the expectation value of a two- 
body phase function, we only need to know the two-body reduced probability 
density. 

The one-body reduced probability density is given by 


* 

pi(Xi,0 = 


dX 2 • • • dX N p(X i, . . . ,Xw, ?)) 


(6.63) 


where p(X \, . . . , X N , t) = p(X N , t). The 5-body reduced probability density is 
given by 


Ps (Xl , . . . , X s , 7) 


dX s+l . . .dX^ p(X 1? . . . ,X N ,t). 


(6.64) 


If the probability density, p(Xi, . . . ,X N ,t), is known at time t, then the 
expectation value of the one-body phase function at time t is given by 


(°(1) W> = 22 } • • • } d X > • • • d X " 0 ( x i)p( x ". f) = N | dX, 0(X, , t)p , (X, , t) . 

(6.65) 


Similarly, the expectation value of the two-body phase function at time t is 

N(N- 1)/2 


< 0 ( 2 )«>= 22 f- ••frfx 1 ...dx A ,o(x i ,xxx w ,/) 

I <^7 * 


KJ 

N(N - 1) 


(6.66) 


dX l dX 2 0(X u X2,t)p 2 (X u X2,t). 


In Eqs. (6.65) and (6.66), we have assumed that the probability density 
is symmetric under interchange of particle labels if the Hamiltonian is 
symmetric. 

The equation of motion of p s (Xi, . . . ,X S , t) can be obtained from the 
Liouville equation. It is convenient to first introduce another quantity, 
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F S (X i , . . . , X 5 , t ), defined as 

F»(Xi, . . . , X„ I) = V s [ • • ■ f dX s+1 . . . X«, (), (6.67) 

and 

F«(X, , . . . , X N , t) = V N p(X , , . . . , X„, t). (6.68) 

Let us assume that the evolution of the system is governed by a Hamiltonian of 
the form 


N 2 N(N-l)/2 


i= 1 


(6.69) 


»<; 


where 0(|q ; - — q ; -|) is a two-body spherically symmetric interaction potential 
between particles i and j. The Liouville operator is 


N _ rs N(N- 1)/2 

i ' , = -'E-'r + ( £ % 


KJ 


(6.70) 


where 


6 A 

* 5q/ap ; - 


(6.71) 


and fa — 0(|q, — q ; |). If we integrate the Liouville equation, (6.27), over 
X J+ i , . . . , X N and multiply by V s , we obtain 


OF £ 
dt 


+ iL s F 


- v ’i 


dX s + 1 • • • dXpf 

N 


-£M+ £ ®«+ £ ©« 

i— $+1 " i<s,s+l<j<N s+l<k<l 


(6.72) 


x/(X,,...,X Wl f). 


If we assume that p(Xi, . . . ,X N , t) — > 0 for large values of X„ then the first 
and third terms on the right-hand side of Eqs. (6.72) go to zero. One can see this 
by using Gauss’s theorem and changing the volume integration to surface 
integration. For a large system the contribution from p(Xi, . . . ,Xn,t) on the 
surface goes to zero. The second term on the right-hand side can be written in 
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the form 

V- [••• [</X,. l -w/Xv £ § iJ /(X u ...,X N ,t) 

i < s;s+i <j < N 

dX s+ iO l)S+1 1 • • • jdX* + 2 • * • dXup N (Xi, . . . ,X N , t) 
d x s+l 0j jS-i-i F (Xi, . . . ,X* + i,r). 

Equation (6.72) then becomes 

+ iL‘F s = J2 j dX s+Am F, +l (X, , . . . , X, +1 , 0- (6.73) 

For a fixed values of s we may take the limit N — ► oo, V — » oo, . such that 
v = V/N remains constant (this is called the thermodynamic limit) and Eq. 
(6.73) becomes 





^ + itF s = i £ J d X, +1 0 M+I F s+I (X, , . . . , X J+1 , t). (6.74) 

Equation (6.74) gives a hierarchy of equations of motion for the reduced 
probability densities F S (X i, . . . ,X N ,t). It is called the BBGKY hierarchy after 
authors Bogoliubov [17], Bom and Green [18], Kirkwood [19], and Yvon [20]. 
The most useful equations in the hierarchy are those for F\(X\,t) and 
F 2 (X u X 2 ,t): 


and 


dFi p! dF i 


[ dX 2 0, 2 F 2 (X,,X 2 ,t) 


(6.75) 


df 2 , /Pi d p 2 

dt \m dq Y m 




dX 3(613 + 023)f 7 3(Xi,X2,X3 


0 - 


(6.76) 


Notice that the equation of motion of F\ depends on Fi, the equation of motion 
for Fi depends on Ft,, and so on. This makes the equations of the hierarchy 
impossible to solve unless some way can be found to truncate it. For example, if 
we could find some way to write F2 (Xi,X 2,0 in terms of Fi(Xi,f) and 
F!(X 2 ,f)> then we could in principle solve Eq. (6.75) for the reduced 
probability density Fj(Xi,f). Equation (6.75) is called the kinetic equation. 
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► S6.B. Reduced Density Matrices and the Wigner 
Distribution [1 6, 21-25] 

For quantum mechanical system, the phase space coordinates of particles do not 
commute, and therefore it is impossible to specify simultaneously the position 
and momentum of the particles. As a result, it is also not possible to define a 
distribution function on the phase space which can be interpreted as a 
probability density. However, Wigner [16] was first to show that it is possible to 
introduce a function which formally analogous to the classical probability 
density and which reduces to it in the classical limit. 

Before we introduce the Wigner function, it is useful to introduce the idea of 
one- and two-body reduced density matrices. In quantum mechanics, as in 
classical mechanics, we generally deal with one-body operators, 

6(i) = X^(P, •>€!«), (6.77) 

i=i 

such as the JV-body kinetic energy operator, and we also deal with two-body 
operators, 


N(N- 1)/2 

0(2) = <!/)’ 
i<j 


(6.78) 


such as the interaction potential. The trace of a one-body operator, in the 
position basis, can be written 

(OdjWHTrd^M 

= Sj * 1 • • - j j jdx' N (x, x N \Oi 

x {x\, . . . ,x! N \p(t)\xx, . . . ,x N ) 

= N dx x ••• jdfrtf ^dx' l (xi\0\\x' l )(x! l ,x2, . . . ,x N \p(t)\x\, . . . ,x N ) 

= jdbci 

(6.79) 

where 6, = d(p,,q,') and 

(xil/9 (1) (0l*i) =N^dx 2 • • • Jfihc N (xf v x 2 , . ■ • ,x N \p{t)\x x , . ..,x N ) (6.80) 


is the one-body reduced density matrix. (We use the notation of Appendix B.) 
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The two-body reduced density matrix is defined in an analogous manner. The 
trace of a two-body operator in the position basis can be written 

<0 (2) W>=Trd ( V(<) 

N{N-i)/2 r r r r 

— ^ ^ I dx i • • • dxpj dx!-y $ dXpj(x \ , . . . ^ . . . iX^ 

icj ^ J J J 

x (x' 1 ,...,x / N jp(t)jx 1 ,...,x JV ) 

= N ^2 J dxi ■ ■■ J dx N | dx! l J dxf 2 (x u x 2 \d h2 \A^2) 
x {jt l ,j£ 2 ,x$,...,x N \p(t)^cu...,x N ) 

= i J dxi j dx 2 j dx\ dx' 2 (xi , x 2 \d\ t2 |*i , x' 2 ) (x[ , 4 |P( 2 ) (0 |*i , x 2 ) , 

(6.81) 


where = <9(p,-, p ; , q f , q,) and 

(xf v xf 2 \p {2) {t)\x u x 2 ) = N(N - 1) |^3 • • • dx N 

x (44^3, • • • ,*Ar|p(f)l*i> • ••»**> 


(6.82) 


is the two-body reduced density matrix. 

We can now introduce the one- and two-particle reduced Wigner functions. 
The one-particle reduced Wigner function is defined as 

/,(k, R, r) = J dr<A r (R + r - |p (1) (r)|R - 1), (6.83) 

and the two-particle reduced Wigner function is defined as 


/ 2 (k,,k 2 ,R,, R 2 ;f) = f [ <fr,dr 2 e ik ' V 

x (*' + ~ 2 ' y l/ } f2)( f )|Ri - Y’® 2 — y}' 

(6.84) 

Higher-order Wigner functions can be defined in a similar manner. 

In analogy to the classical distribution function, the one-particle Wigner 
function obeys the relations 

f -^ 3 /, (k, R, t ) = <R|p ( i) it) |R) = n(R, t ) , (6.85) 

J ^27 Tj 
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where «(R, t ) is the average number of particles at point R and time, t, and 

jdR/i(k,R,f) = <k|p (l) M|k) =«(k,f), (6.86) 

and n(k, t) is the average number of particles with wavevector, k, at time t. The 
Wigner function can be used to take phase space averages in the same way as 
the classical distribution functions. For example, the average current is defined 
as 


f dk 

(j(R,0) = — -yWi(k,R,0, 

(2tt) 


( 6 . 87 ) 


where h is Planck’s constant. However, the Wigner function can become 
negative and therefore cannot always be interpreted as a probability density. 

We can derive the equation of motion for f\ (k, R, t) in the following way. 
For a system with a Hamiltonian 


" = £ 


h 2 Jcf 

2m 


N(N- 1)/2 

+ £ V(q,.-q 

i<j 


(6.88) 


the equation of motion for the one-particle reduced density matrix is 


|<r,|p ( i)W|r2> = -£(V r , + V r; ) ■ (V ri - V r! )(r,|p (1) W|r2) 

- dr'{V(r 2 - r') - V(r, - r')](r 1 ,r'|p (2) (r)|r 2 ,r'), 

( 6 . 89 ) 


where V(r { - r 2 ) = (ri,r 2 |P(q,- - q,)^!, r 2 ). 

Let us now change to relative and center-of-mass coordinates, r = ri — r 2 
and R = ^(ri + r 2 ), respectively. We then multiply by e ikr and integrate over 
d r and find 


| dre ' kT I < R + \ i^‘))( r )i R - 1) 

= - 1 f d re' k r Vn • V r (R + T - \p m (f)|R - £ 

^-[K(R-f-r')-v(R + I-r')] 


x ( R + 5’ r 'IA2)( f )|R -£«■'). 


( 6 . 90 ) 
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In Eq. (6.90) we can integrate the first term on the right by parts to remove the 
derivative with respect to r. We can also introduce dummy variables into the 
second term on the right and make use of definitions in Eqs. (6.83) and (6.84) to 
obtain 


|/ 1 (k,R,r) = -£kV R / 1 (k,R,r) 


‘f4UV( R . r - >1) 
J (2tt) 3 J L V 2l dk J 


(2tt) 

x/ 2 (k,R;k',r';r). 


V[ R - r' +— — ) 
1 2idk) 


(6.91) 


Equation (6.91) is the quantum kinetic equation for the one-particle reduced 
Wigner function. If we rewrite it in terms of momenta p = Hk and p' = hk', it 
takes the form 


|/i'(p.r.o+~vr/;(p,r,o 


-sJs5>WK*- 

x/ 2 (p, r ; p',r';0> 


r '-||)- v '( R - r ' + I|) 


where 


(6.92) 


/„'( Pi , r, ; . . . ; p„, r„) = ^/„ Qp, , r,; . . . ; ^p„, r„ J . (6.93) 


We can now expand the potential on the right-hand side in powers of H and take 
the limit h — > 0. We then retrieve the classical kinetic equation. 

The Wigner function can be used to take the average value of a large class of 
ordinary functions of momentum and position but in some cases it will give the 
wrong answer. The average value of any quantity which is only a function of 
position or only a function of momentum can always be taken (this is easily 
seen from Eqs. (6.85) and (6.86)). However, only those functions which involve 
both position and momentum can be used for which the Weyl correspondence 
between the quantum and classical version of the operators holds. To see this, 
let us consider the classical function 0(p, q) of phase space variables p and q. 
We can find the quantum version of this function as follows. We first introduce 
its Fourier transform, with the equation 


0(p, q) = idadrid(a, ,)*«•**■«) . 


(6.94) 
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Matrix elements of the quantum operator corresponding to 0(p, q) can be 
obtained from the Fourier transform, ), via the equation 


(r'|d|r") 


l dad if 0(<r, ij)(r'|e'*”^' , ®|r">, 


(6.95) 


where p and q are momentum and position operators. If the classical function 
0(p, q) and matrix elements of the corresponding operator O are related by the 
above procedure, then the expectation value of O is given by 

(°(t)) =| jrfp^rO(p,r)/ 1 '(p,r,f). (6.96) 

There are some cases for which the Weyl procedure does not give the correct 
correspondence between classical and quantum operators (such as the 
commutator [p, r]_, the square of the Hamiltonian H 2 , the square of the 
angular momentum L 2 , etc.) and the Wigner function gives the wrong result. 
Then it is necessary to introduce more general quantum phase space 
distributions which may in general be complex functions [25]. 


■ EXERCISE 6.7. Compute the Wigner function for a system with a 
density operator, p = hVab(e~ a ^ e~ b ^ 2 + e~ b ^ 

Answer: First compute the matrix element of the density operator in the 
position basis, 

(i) 

Now let x\ = X + (jc/ 2) and jcj — (jc/ 2). The Wigner function is then given 
by (after some algebra) 

/(*,X)=J“ dxe^^X + ^plX-j) 


4 abh 2 ( —aX 2 \ ( —bk 2 h 2 \ ( 2abh 2 kX 

exp (rT^J exp VTT^j C0S lTT^ 

(2) 

A plot of the Wigner function is shown here for a = 1 and bh 2 = 2. Note that 
there are some regions where it becomes negative, indicating that it is not a 
probability density. 
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^ S6.C. Microscopic Balance Equations [26] 

For quantum systems with short-ranged interactions and long-wavelength 
inhomogeneities, we can derive microscopic balance equations for the particle 
density, momentum density, and energy density in a manner analogous to the 
derivation for classical systems. We first note, however, that the position and 
momentum operators satisfy the commutation relations, 

[p /, P /]_ = 0, [q ( , q/3- = 0, and [q,, p y]_ = ihSy. (6.97) 

The commutator of the momentum operator p, with an arbitrary function of 
coordinate operators is 


\PhF(q u ...,4v)]_ = (6.98) 

while the commutator of q, with an arbitrary function of momenta 
G (Pi,---,P n) is 


[q,.,G(Pi,...,pJ]_ = i^— . (6.99) 

Let us assume that the dynamics of the system is governed by a Hamiltonian of 
the form 


W 



(1/2)V(V-1) 

^(14- -q, -I)- 


i<J 


( 6 . 100 ) 
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Then from Eq. (6.57) the equation of motion for the operator q, is 


dt 



Pi 

m 


The equation of motion of p, is given by 


dP; 

dt 


E 


'dy(lq.--q/D ' 

dq,- 


E*«- 


(6.101) 


( 6 . 102 ) 


These equations have the same form as the classical equations. 

For quantum systems, the microscopic expressions for the densities must be 
Hermitian in order to be observable. Thus, we must have symmetrized 
expressions for operators which involve both momentum and position. Using 
Eq. (6.57) and the above equations, we can show that the balance equation for 
the particle number density is given by 


dt 


«(q";R) = -V R -J' , (p A ',q ,v ;R), 


where the particle density is defined as 

n(q w ;R) = X; i 5(q j -R) 
/=! 


(6.103) 


(6.104) 


and the particle current density is defined as 


J"(p w ,q A ';R)=^ 


1=1 L 


^(4-R) + tf(4-R)^ 


(6.105) 


As usual, we let denote the set of momenta p^ = ( Pi , • • . , p#) an d we l et q W 
denote the set of positions q^ = (qfi, . . . , q^). 

The balance equation for the momentum density takes the form 

m|j"(p N , q"; R) = -V R • J^p", q"; R), (6.106) 

where the momentum current tensor, J^p^, q^; R), is defined as 


j^pWiR) =^E[p,Mq f - R) + (Mi- - R )P/) r 

i=l 

+ Pi«(q,- “ R )P, + <5(q; ~ R )P/P/] 

+?EE(»- 4) •*■««(%-*)• 

¥i 


(6.107) 
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In Eqs. (6.106) and (6.107) the notation 

V R ■ - R)P,) r = P f V R • 6(4' - R)p, (6.108) 


has been used. 

Finally, the balance equation for the energy density can be written 

|*(p A ',q A ';R) = -V R - JV'.qftR), (6.109) 

where the energy density is defined as 

A(p«,q w ;R) = i^[M(4-R) + «(4-R)4] (6.110) 


1= 1 


with 


14-41) 


j# 


(6.111) 


and the energy current density is defined as 


jV^r) =^E 


4^6(4-R) + W(4-R)| 


/=i L 

+ |tf(4-R)4 + «(4-R)®4 


, N N 

+z££ 


r (p« + p /) 


¥J 


m 


Fy(4 - 4) 6 (4 - R) 


+ < 5(q,-R)(q,-q ; .)Fy 


(P, + Pj) 


m 


( 6 . 112 ) 


To obtain Eq. (6.1 12), one must use the fact that the center-of-mass coordinates 
commute with the relative coordinates. 


► S6.D. Mixing Flow [7-9] 

Ergodicity is not a sufficient condition on a dynamical flow to ensure that a 
probability distribution that initially is localized on an energy surface will 
spread, in a course grained manner, throughout the energy surface. Spreading 
throughout the energy surface is the type of behavior that we need for a 
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Newtonian system to approach a state that might be used as an equilibrium 
state. A type of flow that does have this feature, and therefore might exhibit 
irreversibility in a course-grained sense (because of the unstable nature of its 
dynamics!) is mixing flow. Mixing flow is chaotic and causes any initial 
probability distribution to spread throughout the energy surface. Mixing flow is 
ergodic, but ergodic flows are not always mixing (the exceptions, however, are 
rare). 

A system is mixing if, for all square integrable functions, f(X N ) and g(X N ), 
on the energy surface, S E , we obtain 


1 


lim 

t->±oo^2(E) 


f(X N )g(X N (t))dS E = JSe 


Lf(X N )ds E L g (x N )ds E 


Se 


(£(£))' 


(6.113) 


where ^2(E) is the structure function defined in Section 6.C. 

Equation (6.113) ensures that the average value of a dynamical function 
f(X N ) will approach a stationary value in the limit t — > ±oo. Let 
g(X N ) = ^(X^), where p(X N ) is a nonstationary probability density. Then 


(/«> 


Se 


f{X. N )p{X N (t))dS E 


.±ccJ2(E) ) Sc 


f(X N )dS E . 

JSf. 


(6.114) 


Thus, f(t) approaches an average with respect to the stationary state p s — 

[£(£)]■'■ 

It is important to emphasize that mixing gives a coarse-grained and not a 
fine-grained approach to a stationary state. The average of the probability 
density becomes uniform, but the probability density itself cannot because ol 
Eq. (6.24). The probability density does not change in a neighborhood of a 
moving phase point, but it can change at a given point in space. We can 
visualize this if we consider a beaker containing oil and water. We add the oil 
and water carefully so that they are initially separated, and we assume that they 
cannot diffuse into one another. We then stir them together (Fig. 6.2). The local 
density and the total volume of the oil remain constant, but the oil will get 
stretched into filaments throughout the water. Therefore, on the average, the 
density of the oil will become uniform throughout the beaker. If we are careful 
enough, we can also stir the oil back into its original shape. Therefore, while we 
get an approach to uniformity, the whole process can be reversed. However, as 
we shall see, mixing does lead to the appearance of random behavior in 
deterministic systems and coarse-grained irreversibility. 

The meaning of Eq. (6.1 13) may therefore be summarized as follows. Let A 
and B be two finite arbitrary regions on the surface S E . Let us assume that all 
phase points initially lie in A. If the system is mixing, and we let it evolve in 
time, the fraction of points which lie in A or B in the limit t — > ±oo will equal 
the fraction of area S E occupied by A or B, respectively. 
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Fig. 6.2. Stirring oil and water together leads to uniformity, on the average. 


An example of a discrete dynamical system which is mixing is the Baker’s 
map. The dynamics evolves on a unit square, 0 < p < 1 and 0 <q< 1. The state 
points evolve in discrete time steps and change value in a discontinuous manner. 
The dynamical evolution is governed by an “alphabet” with two letters, 0 and 
1, and the set, {5}, of all possible infinite sequences of letters. Each infinite 

sequence has the form 5= (. . . ,S- 2 ,S-\,So,S\,S 2 ), where S* =(0 or 1) 

and k = (0, ±1, ±2, . . .), and corresponds to a state point in the two- 
dimensional phase space. Each sequence maps onto the unit square according 
to the rules, 


-°° t _j So , $-1 , S_2 


p = E^‘- , =t+ 


*=0 


4 + T + 


and 


„ _ 'sp C - Si ,£> S 

<7-2_> 2 -T + T + T + 


(6.115) 


*=i 


Thus, if all S* = 0, then p = q = 0, and if all S* = 1, then p = q = 1 since 
= 1* All other cases also lie on the unit square, 0<p<l and 

0<<7< 1. 

The dynamics is introduced into the phase space by means of the Bernoulli 
shift, U, which is defined so that USk = Sk +\ . That is, the operator, U, acting on 
a sequence, S, shifts each element to the right one place. The shift acting on the 
sequence, S, is equivalent to the following transformation (the Baker’s 
transformation) on the unit square: 


U{p , q) 


(2 P,\q), 0 <p<\ 

+ \<p< 1. 


The inverse transformations is 


r (5P.2?), o<?<i 

j(ip + i,2«-l), \<q<\. 


(6.116) 


(6.117) 
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The Jacobian of the transformation is equal to one. Therefore, the mapping, U, 
is area-preserving. Let us now look at how the probability density evolves under 
the action of the shift, U. Let po(p, q ) be the initial probability density and let us 
assume that it is continuous and smooth. After we allow U to act n times, we 
obtain 


Pn(p,q) = U n p 0 {p,q ) = Po (U n p, U n q ). 


More explicitly, 


pn+i(p,q) 


pn{\p,2q), 0 <q<\ 

p n (\p + \,2q- l), |<^<1 


(6.118) 


(6.119) 


The time evolution operator, U, is an example of a Frobenius-Perron operator 
[27] and [28]. 

The effect of the Baker’s transformation is to stretch an initial area element 
into filaments throughout the unit square, much as a baker does in kneading 
dough. Note that whenever S 0 = 0 the point corresponding to the sequence, S, 
will lie to the left of p = \ and when So = 1 it will lie to the right of p = {. 
Therefore, points corresponding to sequences with 0 and 1 distributed at 
random in the positions, S k , will be shifted to the right or left of p = \ by U at 
random. If initially the probability density is po(p,q) = 0 for 0 <p<\ and 
0<<?<1 and p{p , q) = 2 for \<p< 1 and 0 <q< i, then the probability 
density at times n = 1, 2,3, are shown in Fig. 6.3. 

As we have seen above, there is an element of randomness in the position of 
a point in the p direction. Let us look at the reduced probability density, 4>{p), in 
the p direction. The quantity 4> n {p)dp is the probability of finding a point in 
the interval p —> dp at ‘time’ n. It is defined as 4> n = J 0 dqp n (p,q). Using 
Eq. (6.119), (j) n +i{p ) becomes 


^, + i(p)=^(f)+^(§+0- ( 612 °) 

The reduced probability evolves in a Markovian manner. We can show that 
lim n _ +0o (j) n (p) = 1 and, therefore, that the reduced or coarse-grained probability 


d d 
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Fig. 6.3. Evolution of an initial probability under the action of the Baker’s map. 
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density approaches a constant. If we iterate Eq. (6.120), we obtain 

For a continuous and smooth function, (f>o(p), we obtain 

( 1-1/2" /.l 

dy(j>o[^ + y) = dy<f> 0 (y) = 1, (6.122) 

o v2" / Jo 

where we have let y = k2~ n . Notice that for n — > — oo, <j> n (p) will not approach 
a constant. However, a reduced probability density, defined in terms of the 
variable q, does. Therefore, the Baker’s transformation exhibits irreversibility in 
a course-grained sense. 


■ EXERCISE 6.8. Compute the trace of the Baker map, U n . 

Answer: Let us denote U n {p,q ) = (p n (p,^),^”(p,^)), where p n {p, q) and 



t f{p , q) are functions of p and q. The trace of U n can be written 

Trf/"= \'dp\'dp6(p-p"(p,q))6(q-cf(p,q)). (1) 

Jo Jo 

Before we evaluate Eq. (1), let us note the following property of the Dirac 
delta function. Let us consider a function, /(x), which has zeros at points xoy 
(where k = 1, 2, . . . ,Af), so that f(x 0 ,*) = 0. Then 


__ 8(x — xq ,*) 




where f(x) = (df/dx). 

We now can evaluate the trace of U n . Notice that the delta functions will 
give contributions when p n = p and q n = q. That is, for all 2” nth-order 
periodic points of the map, U n . First note that (dp n /dp 0 ) = 2" and 
(dq n /dq 0 ) = (1/2”). Thus, 


Tr U n = | 

Period n points ' 


= 1 - 


^P- 1 ! IP 1 " 1 = 2”(2” — l) -1 ( 1 — 

dpo | \dq 0 V 

l 00 / 1 \ n 

m=0 / 


(cf. Ref. 28). The trace of U n is equal to the sum of the eigenvalues of U n . 
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► S6.E. Anharmonic Oscillator Systems [4] 

The study of anharmonic oscillator systems has long been important in 
statistical mechanics in connection with the theory of heat transport in solids 
[29]. One would like to know the mechanism of heat conduction. If heat is 
added to a harmonic lattice and divided unequally between the normal modes, 
there is no way for the system to reach equilibrium because the normal modes 
do not interact. However, if slight anharmonicities exist in the lattice, it was 
expected that equipartition of energy would occur and that the system would 
thus reach equilibrium. 

In 1955, Fermi, Pasta, and Ulam [30] conducted a computer experiment 
intending to show this. They studied a system of 64 oscillators with cubic and 
broken linear coupling. They found that when energy was added to a few of the 
lower modes there was no tendency for the energy to spread to the other modes. 
This behavior is quite different from what one would expect if the anharmonic 
oscillator system were an ergodic system. Then one expects the system to reach 
a stationary state in which all states with the same energy would be equally 
probable, and one expects to see energy sharing among the modes. 

The type of behavior that Fermi, Pasta, and Ulam observed is now fairly well 
understood in terms of a theorem due to Kolmogorov [31], Arnold [32], and 
Moser [33] (commonly called the KAM theorem). The theorem states that for a 
system with weak anharmonic coupling (which satisfies the conditions of the 
KAM theorem), most of the energy surface will be composed of invariant tori 
and the system will exhibit behavior in many respects similar to that of an 
unperturbed harmonic oscillator system. The energy surface will not be 
metrically transitive. As the coupling is increased, however, the invariant 
regions of phase space break down and at some point one expects to see a sharp 
transition to chaotic behavior and something similar to equipartition of energy 
between the modes. (True equipartition requires ergodicity, and it is not clear 
that anharmonic oscillator systems are ergodic above the transition energy.) 

There is now a variety of nonlinear oscillator systems which have been 
studied and which exhibit a transition from stable to chaotic behavior as certain 
parameters are changed [4]. Henon and Heiles [34] studied the bounded motion 
of orbits for a system with two degrees of freedom governed by the Hamiltonian 

H = \{p\ +p\ + <7? + <l\) + <7i< 72 ~ 50i* (6.123) 

The trajectories move in a four-dimensional phase space but are restricted to a 
three-dimensional surface because the total energy is a constant of the motion. 
It is possible to study a two-dimensional cross section of the three-dimensional 
energy surface. For example, we can consider the surface #2 = 0 and look at a 
trajectory each time it passes through the surface with positive velocity p 2 > 0- 
It is then possible to plot successive points of the trajectory (q 2 = 0,p 2 > 0) in 
the pi, #i plane. If the only constant of motion is the total energy, E, then the 
points should be free to wander through the region of the pi,#i plane 
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Fig. 6.4. Henon-Heiles result for E = 0.08333. (Based on Ref 34.) 


corresponding to the energy surface. The motion we see will appear to be quite 
similar to ergodic motion. If there is an additional constant of the motion, the 
points will lie on a smooth curve in the p \ , q x plane. 

Henon and Heiles studied trajectories whose motion was governed by Eq. 
(6.123) for a variety of energies. The results are sketched in Fig. 6.4. For an 
energy, E = 0.08333, they found only smooth curves, indicating that to 
computer accuracy there was an additional constant of the motion. Each closed 
curve in Fig. 6.4 corresponds to one trajectory. The three points of intersection 
of lines are hyperbolic fixed points, and the four points surrounded by curves 
are elliptic fixed points [4]. However, at an energy E = 0.12500, the picture 
begins to break down (cf. Fig. 6.5). Each closed curve in Fig. 6.5 corresponds to 
one trajectory. The five islands correspond to one trajectory, and the random 
dots outside the closed curve correspond to one trajectory. At an energy of 
E — 0.16667, almost no stable motion remains (cf. Fig. 6.6). A single trajectory 
is free to wander over almost the entire energy surface. In a very small energy 
range the system has undergone a transition from stable to chaotic behavior. 
Additional studies of the Henon-Helies system [35, 36] have shown that 
trajectory points move apart linearly in the stable regions, whereas they move 
apart exponentially in the chaotic regions. 

The change from stable to chaotic behavior sets in rather abruptly. This has 
been understood in terms of an overlapping of resonances in the system. The 
Hamiltonian for a general anharmonic system with two degrees of freedom can 
be written in terms of action angle variables in the form 


H = H 0 {J X , J 2 ) + W (Ji u 02) 


(6.124) 
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Fig. 6.5. Henon-Heiles result for E = 0.12500. (Based on Ref 34.) 



Fig. 6.6. Henon-Heiles result for E = 0.16667. (Based on Ref 34.) 


by means of the transformation 

/ 2 jr.\ 1/2 

pi = and q t = ( ~ j cos 0,-. 

The function H 0 (Ji , J 2 ) has a polynomial dependence on the action variables J 1 
and J 2 (not merely a linear dependence as would be the case for a harmonic 
system) and no angle dependence, while V(J\,J 2 ,<j>i t (j> 2 ) depends on both 
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action and angle variables and is a periodic function of the angles. When A = 0 
the action variables will be constants of the motion, and the angles fa and fa 
will change in time according to the equations 

4>, = uJ i (h,J 2 )t + 4> ifi (6.125) 

for (i = 1 , 2 ) where 

= (6126) 

For the anharmonic case, the frequencies u>i(Ji,J 2 ) will be continuous because 
they depend on the action variables, even for two degrees of freedom. This 
continuous dependence on the action variables is quite different from a 
harmonic oscillator system where the frequencies, a;,-, are constant. 

For systems which satisfy the conditions of the KAM theorem (namely, 
small A and nonzero Hessian, det \d 2 Ho/dj i dj j \ i= 0 for (ij) = 1 and 2), only a 
very small region of phase space, the resonance regions, will exhibit chaotic 
behavior. The rest of the phase space will correspond to stable motion. If one 
tries to construct new action variables e f i which are constants of the motion 
when A 7 ^ 0 but small, one finds that this can be done for most of the phase 
space, except for the resonance zones. In the resonance zones, perturbation 
expansions for diverge. Let us construct a perturbation expansion [37] for 
the action variables ^ i to lowest order in A. Let us consider a Hamiltonian of 
the form 


H = Ho{J u J 2 ) + \^2 V nua2 (Ji,J 2 )cos{ni<t>i +n 2 <h) (6.127) 

«i,n 2 

and let us introduce the generator 


>&) = /l 01 + + n2 s\n{n\(j)\ + n 2 fa) (6.128) 

« 1.«2 

of a canonical transformation from variables fa to variables </, , 3>„ such that 
the variables are constants of the motion. Then 


Ji = 


dF 

dfa 


/«• + ^2 n i B n u n 2 cos(nifa + n 2 fa ) 
« 1,«2 


(6.129) 


for i = 1,2 and 


*i = 


dF 

Ofi 


(6.130) 
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for i = 1,2. If we substitute Eq. (6.129) into Eq. (6.127) and keep terms to 
lowest order in A (this requires a Taylor series expansion of Hq), we obtain 


H = H 0 (/ u / 2 ) + ^{(nitui +n 2 u 2 )B nun2 + AV nii „ 2 }cos(ni<£i +n 2 (j> 2 ). 

n\,ni 

(6.131) 


To lowest order in A, f x and f 2 will be constants of motion if we choose 


Then 


Bfi\ ) n 2 — 


—XV, 


n\,n 2 


(n \u)i + n 2 uj 2 ) ‘ 


(6.132) 


H = H 0 (f l J 2 ) + O(\ 2 ) (6.133) 


and 


- y i + 


XV, 


ni,n 2 


n\,n 2 


(fliUJi + n 2 u 2 ) 


cos(ni0i 4- n 2 (f ) 2 ) 4- 0(X 2 ) 


(6.134) 


Note, however, that since and oj 2 are functions of J\ and J 2 , there are values 
of J\ and J 2 for which the denominator (mwi + n 2 u> 2 ) can be zero, and the 
perturbation expansion becomes meaningless. Indeed, as long as 


\niOJi + n 2 u> 2 \ < XV nun2 , (6.135) 

the perturbation expansion will diverge and ) is not a well-behaved invariant. 
This region of phase space is called the resonance zone and (niuji 4- n 2 u> 2 ) = 0 
is the resonance condition. It is in the resonance zones that one observes chaotic 
behavior. 

If the regions of phase space which contain resonances, and a small region 
around each resonance, are excluded from the expansion for then one can 
have a well-behaved expression for Thus, one can exclude regions which 
satisfy the condition 

[n\u>\{J\,J 2 ) 4- n 2 uj 2 {J\,J 2 )\ <$C XV nun2 . 

For smooth potentials, V nij „ 2 decreases rapidly for increasing n x and n 2 . Thus 
for increasing n\ and n 2 , ever smaller regions of the phase space are excluded. 

Kolmogorov, Arnold, and Moser proved that as A — > 0 the amount of 
excluded phase space approaches zero. The idea behind their proof is easily 
seen in terms of a simple example [38]. Consider the unit line (a line of length 
one). It contains an infinite number of rational fractions, but they form a set of 
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measure zero on the line. If we exclude a region 

/m £ \ m /m e\ 

\n n 3 / ~~ n ~ \n n 3 / 

around each rational fraction, the total length of the unit line that is excluded is 

5Z 5Z ( „3 ) “ 2e H „2 ~ 1 73t°- 


n=\ m= 1 


Thus, for small A, we can exclude the resonance regions in the expansion of 
and still have a large part of the phase space in which J x is well-defined and 
invariant tori can exist. 

Walker and Ford [37] give a simple exactly soluble example of the type of 
distortion that a periodic potential can create in phase space. It is worth 
repeating here. They consider a Hamiltonian of the type 


H = H 0 (J u J 2 ) + A7 1 7 2 cos(20 1 - 2 fa) = E, (6.136) 

where 

H 0 (Ji,h) = J\ +h~ J\ - 3 J,J 2 + Jl (6.137) 

For this model, there are two constants of the motion, namely, the total energy 
H = E and 


1 = J X +J 2 . (6.138) 

Therefore, we do not expect to see any chaotic behavior for this system. 
However, the unperturbed phase space will still be distorted when A ^ 0. The 
frequencies u>,- for this model are given by 

— 2J\ — 3J 2 (6.139) 


-3/1+272. (6.140) 

If we want the frequencies to remain positive, we must choose 0 < /i < ^ and 
0 < / 2 < pj and, therefore, E < 

Let us plot trajectories for the Walker-Ford case (q 2 = 0 ,p 2 > 0) (note that 
<Ji = (2JiY^ 2 cos<f)i and p t = — (2/,-) 1//2 sin0j). We find that for A = 0 the 
trajectories trace out concentric circles in the pi,qi plane. When the 


and 


dHn 


u 1 = 


dJi 


UJ 2 


dHo 

dJ 2 


= 1 
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Fig. 6.7. Cross-section of the energy surface for the Hamiltonian, H = J\ + 
J 2 - J\ — 3/1/2 + J% + A/1/2 cos (20i — 202) = E. There is no chaotic behavior. 
(Based on Ref. 37.) 


perturbation is turned on (A 7^ 0), the phase space becomes highly distorted. If 
we set 02 = § 7r (this means <72 = 0,p2 > 0) and substitute Eq. (6.138) into Eq. 
(6.136), we obtain the following equation for the perturbed level curves: 

(3 + A cos 20i )/ t 2 - (5/ + XI cos 20i )/i + 1 + 1 2 — E = 0. (6.141) 

They are sketched in Fig. 6.7. Most of the phase space is only slightly distorted 
from the unperturbed case. However, there is a region which is highly distorted 
and in which two elliptic fixed points (surrounded by orbits) and two hyperbolic 
fixed points appear [4]. The fixed points occur for values of /, and 0/ such that 
J\+h = (0i — 02) = 0. If we use the fact that /, = —dH/dfa and 
0i = dH/dJi and condition 02 = 3tv/2, we find that the hyperbolic orbits 
occur when 


/i = m — and (0> i-02)=O and 7 r, (6.142) 

(1 - A) 

while the elliptic orbits occur for 

J \ = [1 ( 0 i -02) = | and y- ( 6 - 143) 

The first-order resonance condition for this model [cf. Eq. (6.135)] is 
2uj\ — 2u>2 = 0 or, from Eqs. (6.139) and (6.140), J\ = 5/2. Therefore, from 
Eqs. (6.142) and (6.143) we see that the distorted region of phase space lies in 
the resonance zone. 
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In general, for a Hamiltonian of the form 

H — H 0 (J u J 2 ) + AV(7i,y 2 )cos(ni0i + n 2 cf) 2 ) (6.144) 

there will be no chaotic behavior because there is always an extra constant of 
motion, 


I = n 2 J\-n\J 2 . (6.145) 

However, when the Hamiltonian is of the more general form given in Eq. 
(6.127), the extra constant of motion is destroyed and the resonance zones 
become more complicated and begin to overlap. When this occurs one begins to 
see chaotic behavior. 

Walker and Ford study the example 

H = H 0 (J u J 2 ) + Ai/iy 2 cos(20i - l(f) 2 ) + A 2 7 1 7 2 3/2 cos(20i - 3 0 2 ), (6.146) 

where an extra cosine term has been added to Eq. (6.136). For this model there 
is no longer an extra constant of motion. There are two primary resonances 
which grow as Ai and A 2 are increased. In Fig. 6.8, we sketch their results. For 
low energies there is no chaotic behavior (to computer accuracy). However, as 
the resonance zones grow and begin to overlap, the trajectories in the regions of 
overlap become unstable and begin to exhibit chaotic behavior. In Fig. 6.8 the 
dots correspond to a single trajectory. 



Fig. 6.8. Cross section of the energy surface for the Hamiltonian, H = J\ + J 2 
~J] — 3/1/2 + / 2 + A 1 / 1/2 cos(2^>i — 2 fa) + A 2 /i J 2 cos(20i — 3<fo) = E. (a) Phase 
space trajectories below the energy of primary resonance overlap, (b) Phase space 
trajectories above the energy of primary resonance overlap. When primary resonances 
overlap, large-scale chaos occurs in their neighborhood. (Based on Ref. 37.) 
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Thus, from these simple examples we see that the chaotic, or ergodiclike, 
behavior of phase space for the anharmonic oscillator system appears to be 
caused by the overlapping of resonances. If the energy surface is filled with 
resonance zones, as is often the case, then we expect chaotic behavior to set in 
at very low energy. 

Anharmonic oscillator systems are a rather special type of system and 
their ergodicity has never been established, for obvious reasons. A completely 
different type of system is a system of hard spheres. For systems of hard 
spheres, ergodicity and mixing behavior have been established [39]. A 
proof that systems with Lennard-Jones types of potential are ergodic has 
never been given. However, when the number of degrees of freedom becomes 
large, the “regular” regions of the phase space appear to become relatively 
less important than the chaotic regions and statistical mechanics, which is 
built on the assumption that ergodicity appears to work perfectly for those 
systems. 

The chaotic behavior illustrated in this section is indicative of unstable flow 
in phase space. Orbits in the chaotic region which initially neighbor one another 
move apart exponentially and may move to completely different parts of the 
energy surface. If we start with an ensemble of orbits in some region of phase 
space and assign a probability distribution to them, the probability distribution 
will spread on the energy surface, and we will become less certain about the 
actual state of the system. Systems with unstable flow have the potential of 
exhibiting decay to thermodynamic equilibrium: An initially localized 
probability distribution can spread and, in a coarse-grained sense, can fill the 
energy surface. 


► S6.F. Newtonian Dynamics and Irreversibility [40, 41] 

The instability and chaos that we have described in the baker map, Eq. (6.1 16), 
and that we have illustrated in the Henon-Heiles system appears to be a source 
of the irreversibility seen in nature. One of the great paradoxes of physics is the 
fact that Newton’s equations are reversible, but much of nature evolves in an 
irreversible manner: Nature appears to have an “arrow of time.” There is a new 
field of statistical physics which finally is resolving this paradox [40-44]. The 
resolution of the paradox is most easily seen in the spectral properties of chaotic 
maps such as the baker map. Individual trajectories in chaotic systems move 
apart exponentially and become impossible to compute even after a fairly short 
time. However, in such systems, smooth initial probability distributions 
generally relax to a smooth final distribution after some time. There are now 
several “reversible” chaotic maps for which a spectral decomposition can be 
obtained in terms of the decay rates and their associated eigenstates [28, 42, 
43]. The decay rates are related to the Lyopounov exponents for the underlying 
chaos, and determine the physically observable decay properties of such 
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systems. The spectral theory of these systems can be formulated outside of 
Hilbert space. 

Considerable progress has also been made in understanding the emergence 
of irreversible behavior in unstable Hamiltonian systems, at least for the case 
when the dynamical phase space contains dense sets of resonances. For 
such systems a spectral theory can also be formulated outside the Hilbert space 
[45-46]. We don’t have space to say more about this beautiful new area of 
statistical physics, but the cited references should give interested readers a fairly 
readable entrance to the field. Ref. 41 gives a historial overview. 
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PROBLEMS 

Problem 6.1. Consider a system of N uncoupled harmonic oscillators with Hamiltonian, 
H = (pj /2wi + kiqf/ 2). Assume that the system initially has a probability density 

p(p N , q , 0) = n!ii <5(i pi - Pio)S(qi - qio). Compute the probability density p(p N , q N , 0 
at time t, where = (p\ , . . . ,Pn) and q N — (q\ , , q^). 
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Problem 6.2. Consider a particle which bounces vertically in a gravitational field, as 
discussed in Exercise 6.1. Assume an initial probability distribution, p(p,z, 0) = 
^<5(z)0(l.O - p)Q(p - O.1)(0(jc) is the Heaviside function; 0(x) = 1 for x > 0 and 
Q(x) = 0 for x < 0). What is p'(J, 9 , 0)? Sketch p(p, z , t ) and p'(J, 9 , t ) for t = 0.4, mass 
m — 1, and gravitational acceleration g = 1. 


Problem 6.3. Consider a particle with mass m = 1 moving in an infinite square well 
potential, V(x) = 0 for -1 < x < 1 and V(x) = oc otherwise. Assume that initially the 
particle lies at x = — 1 with momentum, p — po for 0. 1 < po < 1 .0 in the positive x 
direction, (a) Find the solution of the Liouville equation in action-angle space at time t. 
(b) At what time does the initial distribution of points begin to break apart in (p,x) 
space? 


Problem 6.4. For a noninteracting gas of N particles in a cubic box of volume V = L 3 , 
where L is the length of the side of box, find the solution, p(p 3A/ , q 3N , t), of the Liouville 
equation at time t, where p 3N = (pi, . . . , p^) and q 3N = (qi , . . . , q^) with 
p j = (pi X ,piy,pi Z ) and q, = (q ix ,q iy ,q iz ). Assume periodic boundary conditions, and 
assume that the probability density at time t — 0 is given by 


p(p 3 "-q 3 ",0) 



for 0 < qi a < L. 


Problem 6.5. Consider a system with one degree of freedom whose dynamics is 
governed by a Hamiltonian of the form H(p,q ) =\p 2 = E, where E is the total 

energy. Assume that initially p(p,q,0) = (l/y^r )8{p)e~ q . Solve the Liouville equation 
for p(p, q, t). (Hint: It is useful to first transform to action-angle variables. The solution 
involves elliptic functions.) 

Problem 6.6. A two-level system has a Hamiltonian matrix 

(H lA H lt2 \ = ( 3 4 i\ 

\h 2 ,x h 2 , 2 ) \ -4* -3 y 

where, for example, H\ 2 = (\\H\2). The density matrix at time t — 0 is 

(pi. i(0) M0)) /I 0\ 

Vph(O) P2.2(0)J \0 0 ;• 

(a) Find the density matrix 

f Pi,i( 0 Ph2(t)\ 

\ P2,l (0 P2,2(t) ) 

at time t. (b) What is the probability to be in the state |1) at time t — 0? At time tl For 
simplicity, assume that h = 1. 

Problem 6.7. An atom with spin 1 has a Hamiltonian H = AS \ + B(S 2 X — S 2 ), where 
S x ,S y , and S z are the x,y, and z components of the spin angular momentum operator. In 
the basis of eigenstates of the operator, S z , these three operators have the matrix 
representations 





and 




1 

0 

-1 



(10 ° 
s z = n o o o 
Vo o -l 
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(a) Write the density matrix (in the basis of eigenstates of S z ) at time t = 0 for two 
different cases: (i) The atom is initially in an eigenstate of S z with eigenvalue +H; (ii) 
the atom is initially in an eigenstate of S x with eigenvalue +h. (b) Compute the density 
matrix (in the basis of eigenstates of S z ) at time t for each of the two cases in (a), (c) 
Compute the average z component of spin at time t for the two cases in (a). 

Problem 6.8. Consider a harmonic oscillator with Hamiltonian H = (l/2m)p 2 + 
ima) 2 * 2 . Assume that at time t = 0 the oscillator is in an eigenstate of the momentum 
operator, p( 0) = \po)(po\. (a) Write the Liouville equation in the momentum basis, (b) 
Compute the density matrix (p'\p(t))\p), at time t. 

Problem S6.1. Locate all period-3 points of the Baker map in the (p , q) plane. 



PART THREE 


EQUILIBRIUM STATISTICAL 
MECHANICS 
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EQUILIBRIUM STATISTICAL 
MECHANICS 


7.A. INTRODUCTION 


Most systems in nature, if they are isolated (i.e., do not exchange energy or 
matter with the outside world), will tend to a time-independent state 
(thermodynamic equilibrium) whose macroscopic properties may be described 
in terms of only a few state variables. For such stationary states the entropy is a 
maximum and the energy is fixed. However, even though the energy is fixed, the 
system could exist in any one of a number of different microscopic states 
consistent with that energy. If we only know the total energy, we have no way of 
distinguishing one microscopic state from another, nor do we have a way of 
assigning probability to those various microstates. As we have seen in Chapter 
6, for the special case of ergodic systems, we can assign a probability to various 
microscopic states based on the mechanical properties of the system. If a system 
is ergodic, it is equally probable to find the state of the system in different 
regions of the energy surface if those regions are equal in size. This leads 
naturally to the following probability distribution on the energy surface: 


p(X N ,E) 


1 


£(£) 

0 


for H(X n ) = E , 
otherwise, 


(7.1) 


where * s the area of the energy surface (the structure function). This 

choice of probability density for an isolated system forms the foundation upon 
which statistical mechanics is built. 

Once the distribution function is given, it is a straightforward matter to 
compute the expectation values of various quantities, such as energy, 
magnetization, and so on. However, there is one quantity which still eludes 
us, and that is the entropy. We know that the entropy must be additive and 
positive and must have a maximum value at equilibrium. An entropy of the 
form S = J dX N [p(X N )] 2n CN, where n is some positive integer and C N is a 
constant, is one type that could be used. However, the form that we shall use in 
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this book is the Gibbs entropy 

S = -k B [ dX N p(X N ) ]n[C«p(X w )], (7.2) 

where k B = 1.38 x 10~ 23 J/K is the Boltzmann constant. The constant, Cn, is 
inserted to give the correct units, but also has important physical meaning. It is 
determined from quantum mechanics. The form of entropy in Eq. (7.2) was 
chosen by Gibbs [1] because it gives the correct expression for the temperature 
of systems which are closed but not isolated. For quantum systems, the Gibbs 
entropy can be written 


S=-k B Tr[p\n{p)\, (7.3) 

where p is the density operator and the trace is taken over any complete 
orthonormal set of basis states. 

As we have seen in Chapter 2, we must have the ability to describe the 
behavior of systems under a variety of external constraints. Closed isolated 
systems have fixed total energy and fixed particle number. Closed systems have 
fixed particle number but varying energy so that only the average energy is 
specified. Open systems can have varying particle number and energy. A closed 
isolated system has an equilibrium probability density (the microcanonical 
ensemble) of the form given in Eq. (7.1). However, the probability densities for 
closed and open systems (the canonical and grand canonical ensembles, 
respectively) will be determined below. Once the equilibrium probability 
density for a system is known, the problem of computing thermodynamic 
quantities is straightforward. 

In this chapter we have selected a variety of fairly simple examples to 
illustrate how the equilibrium probability densities can be applied to real 
systems. We have selected the examples for their historical and practical 
importance or because they illustrate important concepts that we shall need 
later. The models and systems we will study in this chapter are all exactly 
soluble. In subsequent chapters, we will introduce a variety of approximation 
schemes for systems which cannot be solved exactly. 

This chapter is divided more or less into three sections. We begin by deriving 
the thermodynamic properties of two closed isolated systems (fixed energy and 
particle number), namely, an ideal gas and an Einstein solid using the 
microcanonical ensemble. Following a method due to Einstein, we can also use 
the microcanonical ensemble to derive a probability distribution for fluctuations 
of thermodynamic quantities about the equilibrium state. The Einstein method 
is based on a Taylor series expansion of the entropy about absolute equilibrium, 
and therefore it breaks down near the critical point where fluctuations can 
become very large. However, it still gives us valuable insight concerning the 
behavior of fluctuations near a critical point. In this chapter we will apply 
Einstein fluctuation theory to fluid systems. 

We next consider closed systems in which the temperature and particle 
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number is fixed but the energy can fluctuate. Such systems are described by the 
canonical ensemble. As examples of closed systems, we consider in some detail 
the effect of lattice vibrations on the thermodynamic properties of solids. We 
also compute the thermodynamic properties of an Ising spin lattice. We can 
obtain exact expressions for the thermodynamic properties of one-dimensional 
spin lattices, and we use a mean field model to obtain approximate expressions 
for the thermodynamic properties of higher-dimensional spin lattices. Mean 
field theory predicts that a phase transition from a disordered state to an ordered 
state occurs on the spin lattice. 

Finally we consider open systems in which both the energy and particle 
number can fluctuate but the temperature and chemical potential are fixed. Such 
systems are described by the grand canonical ensemble. The grand canonical 
ensemble is especially suitable for describing systems in which phase transitions 
occur which break gauge symmetry because particle number need not be conser- 
ved. We will use the grand canonical ensemble to compute the thermodynamic 
properties of ideal quantum gases, both Bose-Einstein and Fermi-Dirac. 

An ideal Bose-Einstein gas is composed of identical bosons and at very low 
temperatures can exhibit a phase transition (even though the particles do not 
interact) in which a macroscopic number of particles condense into a single 
momentum state merely because of Bose-Einstein statistics. An ideal Fermi- 
Dirac gas on the other hand does not exhibit a phase transition but, because of 
the Pauli exclusion principle, has interesting behavior of its own. No two 
fermions can have the same set of quantum numbers. Therefore, at low 
temperature the fermions begin to fill all the low energy momentum states and 
even at T = 0 K some particles can have a very large momentum. Thus a Fermi 
gas, even at T = 0 K, will have a large pressure. 

For an ideal Fermi gas, there is no condensation into a single momentum 
state. However, if we allow an attraction between fermions, then they can form 
bound pairs which can condense in momentum space. This is what happens to 
electrons in a superconductor. In a superconducting solid, electrons interact 
with lattice phonons and with one another through a phonon-mediated 
interaction which is attractive in the neighborhood of the Fermi surface. The 
lermion pairs condense in momentum space and act coherently, thus giving rise 
to the unusual superconducting properties observed in such systems. 

7.B. THE MICROCANONICAL ENSEMBLE [1-3] 

For an ergodic mechanical system with fixed particle number (closed) and fixed 
energy (isolated), all states on the energy surface are equally probable. This fact 
forms the basis upon which equilibrium statistical mechanics is built and is the 
origin of the microcanonical ensemble. As we shall now show, this distribution 
extremizes the Gibbs entropy and therefore allows us to make contact with 
thermodynamics. Let us first consider a quantum system. As we saw in Chapter 
6, the equilibrium probability density, in order to be a stationary state, must be a 
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function of the Hamiltonian, p = p(H). Let \E,n) denote a set of states of 
energy E with respect to which the density operator is diagonal. The integer, n, 
takes values n = 1, . . . , N(E), where N(E)is the total number of states with 
energy E. The probability to find the system in state | E, n) is P n = ( E , n\p\E, n ), 
and the entropy can be written 

N(E) 

S = -k B Tr[ p ln(p)] = -k B P n In (P n ). (7.4) 

«=I 

We must determine the set of probabilities, {P„}, which extremize the entropy 
subject to the constraint, Tr(p) = p n — 1 • The simplest way to do this is 

to use Lagrange multipliers. Since we have one constraint, we need one 
Lagrange multiplier, which we call ao- We then require the following variation 
to be zero: 


6 



(a 0 P n - k B P n In (/*«)) 


N{E) 

= Y, («o - k B - k„ In (P„))6P„ = 0. (7.5) 

n= 1 


Since the variation, 6P n , is arbitrary we must have ao — k B — k B ln(P„) = 0 or 


P n = e 1 ao//ka = constant. 


(7.6) 


The Lagrange multiplier, ao, is determined by the normalization condition, 
Tr(p) = n=\^ p n = 1- We find that the probability, P n , is given by 


Pn 


1 

W)' 


(7.7) 


Thus, the probability distribution which extremizes the Gibbs entropy is the one 
for which all states of the same energy are equally probable. This is called the 
microcanonical ensemble. If we substitute Eq. (7.7) into (7.4), we find that the 
entropy is given by 

S = k B In (N{E)). (7.8) 

The entropy is proportional to the logarithm of the number of states with energy 

E. 

If we are given a thermodynamic system with a fixed mechanical energy E 
and if we know the number of microscopic states with energy E, then we can 
use Eq. (7.8) to compute the thermodynamic properties of the system. The 
internal energy of the system is just U=E. Therefore, the temperature can be 
found from the thermodynamic relation {dS/dE) N x = 1/T. The generalized 
force is Y = (dE/dX) N s . The chemical potential is p' = (dE/dN) s x . As an 
illustration, in Exercise 7.1, we compute the thermodynamic properties of an 
Einstein solid. 
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■ EXERCISE 7.1. An Einstein solid consists of a lattice in three- 
dimensional space with N lattice sites. Each lattice site contains three 
harmonic oscillators (one for each direction in space), each of which has 
frequency u. Neither the lattice sites nor the harmonic oscillators are coupled 
to one another. The total energy of the lattice is H = hw ^ ^ /i,+ 
| Nhu) = E, where n, is the number of quanta of energy on the ith harmonic 
oscillator. The total number of quanta on the lattice is fixed at M = n i- 
Assume that M and N are very large, (a) What is the total number of 
microscopic states with energy E ? (b) Compute the entropy as a function of 
temperature, T, and number of lattice sites, N. ( c ) Compute the heat capacity 
of this solid. 


Answer: Because the harmonic oscillators are spatially separated on 
different lattice sites and at a given lattice site are associated with different 
spatial directions, they are distinguishable. The quanta are indistinguishable. 

(a) The number of ways to assign M identical quanta to 3 N distinguish- 
able oscillators is 


Number of states with energy E = 


(3A + M — 1)! 
M\(3N — 1)! ‘ 


(i) 


An easy way to see this is to imagine M white dots and 3 N — 1 black 
dots mixed together on a line. The M white dots represent quanta. The 
3 N — 1 black dots represent the boundaries between 3 N “pots” 
(harmonic oscillators) which hold the quanta. The total number of 
different states is simply the number of different permutations of the 
black and white dots. 

(b) The entropy is 


5 = ks In 


/(3 N + M- 1)!\ 
V M\(3N — 1)! /' 


( 2 ) 


For M and N large we can simplify this since by Stirlings formula , 
In (A!) « N In (N) — N. Using this formula we find 


5 = ks In 


m sM+ 3 « 

1+ Wj 

(M/3Nf 


\ 

/ 


( 3 ) 


Now note that 1/7 = ( dS/dE) N = (l/hui)(dS/dM) N and 



?VjJ 

y = In 



(4) 
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Solving for M, we find 


M = 


3 N 

eP** - 1 ’ 


where (3 = 1 /k B T. The entropy is 


S = 3 Nk B ln(l - e~^) + 


3Nk B (3hcue- fjhw 
1 — 


The internal energy is 


U = 


E = 


3 Nhue-P*” 
(1 - eft”) 


+ -Nhu. 


The heat capacity is 


3 NhW e-P RuJ 

hT 2 (1 


(5) 

( 6 ) 

(7) 

( 8 ) 


It is useful to perform a similar calculation for a classical system. Let us 
consider a closed isolated system with a configuration space volume, V, and a 
fixed number of particles, N, which is constrained to the energy shell, 
E — > E 4- A E. We now consider an energy shell rather than just the energy 
surface because it simplifies calculations without changing the physics. We can 
make A E as small as we like. The volume of the energy shell is 
e{E,V,N) = J2{E,V,N)AE, where £)(£, V,N) is the structure function, 
Eq. (6.41) (the area of the energy surface). To obtain the equilibrium probability 
density we must find an extremum of the Gibbs entropy subject to the 
normalization condition 


dX N p{X N ) = 1, (7.9) 

JE<H(E)<E+AE 

where the integration is restricted to the energy shell. We again use the method 
of Lagrange multipliers and write 


6 


dX N (a 0 p{X N ) - k B p(X N ) ln[Cfi/p(X lv )]) 

'e<H(E}<E+A£ 


(7.10) 


= dX N (a o -k B -k B ln[C w >(X*)])<5/>(X ,v ) = 0. 

JE<H{E)<E+AE 

Since Sp(X N ) is arbitrary, we must have p(X N ) = C^ 1 e ao,/kB ~ 1 = constant. 
The Lagrange multiplier, ao, is determined from the normalization condition, 
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Eq. (7.9). We find 


P(X") = 


1 


n AE (E, V , N) 
0 


for E < H( E) <E + AE, 
otherwise. 


If we substitute Eq. (7.11) into Eq. (7.2) for the entropy, we find 


S = k E In 


^ ae(E, V,N) \ 


(7.11) 


(7.12) 


Comparison with Eq. (7.8) shows that fl A E(E, V,N)/Cn is just the number of 
states in the energy shell, E — ► E + A E. The constant, Cm, can be determined 
from quantum mechanics. The volume of a single quantum state in a 6N- 
dimensional phase space is h 3N , where h 6.63 x 10 -34 J S is Planck’s constant. 
Therefore, the constant, Cm, is chosen as follows. For distinguishable particles, 
Cm — h 3N so CIae/Cn is just the number of quantum states confined to the 
energy surface. For indistinguishable particles, Cm = N\h 3N . The factor of /V! is 
included to avoid overcounting of states when the particles are indistinguish- 
able. It is purely quantum mechanical in origin and cannot be justified 
classically except that it solves Gibb’s paradox. In Section 7.D, we shall show 
exactly where it comes from. 

In practice, when computing the thermodynamic properties of a system of 
energy E, it is easiest to work with the total volume of phase space, Q(E, V , N), 
enclosed by the energy surface, rather than the volume of the energy shell. In 
the thermodynamic limit (the limit N —> oo and V — ► oo such that 
N/V = constant), both quantities give the same entropy. To see this, let us 
divide the volume enclosed by the surface of energy E into E/AE shells each of 
width A E. These shells will range in energy from 0 to E. Shells with low energy 
will have a smaller volume than shells with higher energy. The total volume can 
be written as a sum of the volumes of all the shells. 

E/AE 

n (e,v,n) = J 2 nMEi<v,N). (7.13) 

(=1 

The shell with the largest volume will be that with energy E — * E 4- A E. Thus, 
we can write 

fiA e(E, V, N) n (E, V,N)^-E n^E, V, N). (7. 14) 

If we take the logarithm of each term in Eq. (7.14), we find 
ln(^A£) < In (O) < \ji(Qae) - In 



(7.15) 
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But ln(Q)~A and ln(£/AE) ~ ln(N). Therefore, in the limit N — > oo, 
ln(O) = ln(QA£), because we can neglect terms of order In (N) relative to 
terms of order N. Thus, in the thermodynamic limit the entropy can be written 


S = ks In 


^(E, V,N) ^ 


(7.16) 


where C^ = h 3N for distinguishable particles and Cn = N\h 3N for indis- 
tinguishable particles. In Exercise 7.2, we compute the thermodynamic 
properties of an ideal classical gas using Eq. (7.16) and the phase space 
volume, fi(£, V,N), computed in Exercise 6.2. 


■ EXERCISE 7.2. Use the microcanonical ensemble to find the entropy 
and equation of state of an ideal gas of N identical particles of mass m 
confined to a box of volume V Assume that N and V are very large. 

Answer: From Exercise 6.2, the volume of phase space with energy less than 
E is 


ow-ggrr. 

v ' T(3A/2-fl) 

For very large N and V, the entropy is given by 

c _. , /0(E) A , . ( V N (2irmE} mp 
B \N\h M ) " n ( N\h™r{3N/2 + 1) 


(I) 


( 2 ) 


Let us note that for large N, we have ln(A!) » N \n(N) — N and 
T(3A/2 + 1) = (3N/2)\ « (3N/2e) 3N / 2 . We can rewrite the entropy in the 
form 




2 

5 Nk B 


4 -Nks In 


v '(w) /2 j + Nk ‘- Nk sHN) 

V f 4nmE \ 3//2 
N \3h 2 NJ 


( 3 ) 


In the middle term we have explicitly separated off the contribution 
(-1 -Nks — Nks In (A)) coming from the Gibbs counting factor. We see that 
without it, the entropy is not a homogeneous function. In the last term, the 
Gibbs counting factor is combined into the rest of the terms, and we see that 
the entropy is just the expected Sackur-Tetrode entropy and is a 
homogeneous function of the extensive variables. 

If we remember that the internal energy, U , equals E, then Eq. (3) is the 
fundamental equation for the entropy. The pressure is given by 
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P = —(dU/dV) SN . If we take the derivative of Eq. (3) with respect to V 1 
holding 5 and N fixed, we obtain an equation for P = -(dU /dV) S N which 
we can solve to give P = 2U/3V. Next note that T = (dU/dS) V N . If we ; 
take the derivative of U with respect to S holding V and N fixed, we obtain an 
equation for ( dU/dS) v N which we can solve to yield U = \Nk B T. If we 
combine these two results, we obtain the ideal gas equation of state 
PV = Nk B T. j 


7.C. EINSTEIN FLUCTUATION THEORY [4, 5] 

There is a very simple method due to Einstein [5] for obtaining the probability 
distribution for fluctuations about the equilibrium state. This theory uses the 
microcanonical ensemble. We first derive the general formalism and then apply 
it to thermodynamic systems 

7.C.I. General Discussion 

Let us consider a closed adiabatically isolated system with energy in the range 
+ A E. We shall assume that the system is ergodic so that all possible 
microscopic states of the system are equally probable. Let r(E) denote the 
number of microscopic states with energy E — ► E + A E. Then the entropy of 
the system is given by 


S = k B ln[r(£)]. (7.17) 

Often it is useful to have a more detailed microscopic description of the system 
than that given by Eq. (7.17). For example, we might want to subdivide a 
system into cells and to specify its state according the thermodynamic behavior 
of each cell (each cell would be an open system). We can do this if we assume 
that, in addition to the energy, the macroscopic state of the system is describable 
in terms of n independent macroscopically measurable parameters (state 
variables) A,(i = 1, 2, . . . , n), where A, could denote quantities such as energy, 
pressure, temperature, and so on of the ith cell. 

Let r(E,Ai, . . . , A n ) denote the number of microstates with energy E and 
parameters A\,...,A n . Then the probability that the system is in a macroscopic 
state described by parameters E, A\, . . . ,A n , is given by 

P(E,A 1 ,...,A„)= r(£ ’^' (£ y’ A " ) . (7.18) 

The entropy of the system in a state with parameters (E,Ai, . . . ,A„) is given by 
S(E, Aj, .. . ,A„) = k B ln[r(E, A,,... ,/!„)]. (7.19) 
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Hence 


P(E,A U ...,A.) 


m exp 


^S(E,A U ... 

KB 



(7.20) 


The entropy will be a maximum when the system is in an equilibrium sate, 
A®, . . . , Aj. Any fluctuation about the equilibrium state must cause the entropy 
to decrease. If we let a, denote the fluctuations 


cc l = A i -A° i , (7.21) 

then we can expand the entropy about its equilibrium value to obtain 

S(E, A u ...,A n )= S(E, Al...X)-li2J2«‘i a ‘ a i + ---’ ( 7 - 22 ) 


1=1 j= 1 


where 


8ij = 


\dAidAjJ Al , A o ;A/=A o 


(7.23) 


Einstein fluctuation theory is based on the assumption that fluctuations about 
the equilibrium state are small. Therefore, we can terminate the expansion of 
the entropy at second order. This approximation is not good near a phase 
transition where fluctuations can become very large. The matrix g is positive 
definite since the entropy must decrease and is symmetric since the quantities, 
A,-, are state variables. (A matrix is positive definite if every principal minor is 
positive.) Equation (7.22) contains no terms which are first order in A, to ensure 
that spontaneous fluctuations about the equilibrium do not cause a decrease in 
the entropy. Equilibrium is a state of maximum entropy. 

We can now substitute Eq. (7.22) into Eq. (7.20) and obtain the following 
expression for the probability distribution of fluctuations about the equilibrium 
state (cf. Exercise 4.9): 


P{ «) = 


detjgj 


(27i -k B ) 


n exp 


1 T - 

— * ga 

2 k B 


I det|g| 
(2irk B ) 


« exp 




(7.24) 


The vector ct T is the row vector x T = (a\ ,...,a„) and is the transpose of the 
column vector a. The quantity gy is the ijth element of the n x n matrix, g. The 
quantity det Igl is the determinant of the matrix g. The probability P( a) is 
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normalized to one: 


* oc oc 

DaP(a) = da\ x • • • x 

J — oc — 00 


* 00 

da n P{ a) = 1 

J —oo 


(7.25) 


Since only small fluctuations are assumed to be probable, there is no difficulty 
in extending the limits of integration in Eq. (7.25) from — oo to +oo. 

We often want to find expectation values of various moments of the 
fluctuations. To do this, it is convenient to introduce a more general integral, 


7(h) = 


' det 1 g 

(27 rk B ) n J 


Da exp 


g a + h 

2 KB 


= e : 


(7.26) 


where 7(h) is a characteristic function (cf. Exercise 4.9). The moment, (a, a 7 ), 
is defined as 


<“^ = ll m o(aSi; /(h) ) = ' e(r,) "- (7 ' 27) 

Since the probability density, P( a), is a multivariant Gaussian with zero mean, 
the first moment, (a,) = 0, and all higher moments can be expressed in terms of 
the moments <«?> and (a, -a/) (cf. Exercise 4.9). 


7.C.2. Fluid Systems 

Let us consider a closed isolated box with volume, VV, which contains a fluid 
with total entropy, Sj, total energy, E T , and total number of particles, N T , and 
total mass, M T . We shall assume that the box is divided into v cells of equal size 
and that at equilibrium the volume, entropy, internal energy, and mass of 
particles in the 7th cell are V/ = Vo, S,-, Ui , and M,, respectively (cf. Fig. 7.1). 
The total volume of the box is then VY = uVq. The equilibrium pressure, 
temperature, and chemical potential of the whole system are denoted Po, To, and 
Ao, respectively. 

From Eq. (2.171), the entropy change due to deviations in various 
thermodynamic quantities from their equilibrium state will be given by 

A St = V[-AT,AS, + AP t AVi - AfcAM,]. (7.28) 

If we now use the expressions obtained in Section 7.C.1, we can write an 
expression for the probability of a given set of fluctuations. To obtain an explicit 
expression, we must choose a particular set of independent variables. 
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Fig. 7.1. A system with fixed total energy, Et, total volume, Vt, total mass, Mr, 
equilibrium pressure, Po, equilibrium temperature, To, and equilibrium chemical 
potential, po- To describe local fluctuations about absolute equilibrium, we divide the 
system into cells. Cell i has volume, entropy, internal energy, and mass given by 
Vi = Vo, Si, Ui, and Af,, respectively. 


Schematically, we can write the probability as 

P(A5r) = - APjAVi + A/jjAM,] . 

(7.29) 

where the matrix g is determined once the independent variables are decided 
upon. 

Let us first pull the volume dependence out of Eq. (7.28). We will write 
S = Vs and M = V p, where s is entropy/volume and p is mass/volume. Let us 
next note that the Gibbs-Duhem equation, when written in terms of mass 
density, p, takes the form pdp = —sdT + dP. Therefore, ATAS — APAW 
ApAM = V(ATAs + ApAp). The differential of the Helmholtz free energy/ 
volume, a, satisfies the equation da = —sdT + pdp, so ( dp/dT) p — —{ds/dp) T - 
If we now choose temperature, T, and mass density, p, as independent variables, 
we find 


ATAS - APAV + ApAM = V 


ry 


+ 



(7.30) 


where c p is the equilibrium specific heat. The row vector, a r , can be written 
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a = (F f , . . . , Y p g , . . . , p v ) and the matrix, g, has matrix elements 



if i = j and 1 ^ i ^ i/; 

if i = j and v 4- 1 ^ i < 2u; 
otherwise. 


(7.31) 


The probability of a fluctuation, <x T = (ATi, . . . , A T y , Api, . . . , A p v ), is 
therefore 


re AT AoM ^ (KDr) 

n{ATh Aa}) = \ 

1 ^ [Vc p , Arr , 2 V / 

x «p[-25§lrS- (AI » +?( 

D <«1 

(7.32) 

We can now obtain expressions for variances and for correlations between 
fluctuations in various cells. It is easy to see that 

POO POO 

(ApjATk) = • • • d(Api ) . . . d{Ap„)d(AT x ) . 

J — (X) — oc 

...d(AT„) 

x P({ATi, Api}) Apj ATt = 0, 

( 7 . 33 ) 


(7.34) 


(7.35) 

and 


(A Pi ) = <A Tj) = 0. 

(7.36) 


Thus, we have found that fluctuations in temperature and density are 
statistically independent. One can also show that pressure and entropy 
fluctuations are statistically independent. However, fluctuations in most other 
pairs of thermodynamic variables are not independent. 

It is important to note that in Eq. (7.33) we found no correlation between 
various cells. This result was built into the theory because Eq. (7.33) contains 
no information about coupling between cells. In real systems, there is coupling 
between cells. This can be included in the theory by expressing AS, and AP, in 
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terms of temperature and volume variations in other cells and not just those of 
cell i. The more general expression will then contain coupling constants which 
reflect the strength of the coupling between the cells. 

In this section, we have analyzed fluid systems by dividing them into discrete 
cells. This, of course, is a rather artificial way to proceed, but it is conceptually 
very simple and gives us good intuition about what thermodynamic quantities 
govern the behavior of fluctuations about the equilibrium state. It is a simple 
matter to change the summations over discrete cells to integrations over 
continuously varying densities, provided that the spatial variations have 
sufficiently long wavelengths (vary slow enough). We shall look at the spatial 
dependence of fluctuations in later chapters. 


7.D. THE CANONICAL ENSEMBLE 

A closed system can exchange heat with its surroundings and as a consequence 
will have a fluctuating total energy. We therefore need to find a probability 
density which corresponds to an extremum of the entropy closed systems. This 
probability distribution is the basis of the canonical ensemble. In Section 7.D.1 
we derive the probability density operator for a closed equilibrium system, and 
in Sections 7.D.2 and 7.D.3 we show how to compute thermodynamic 
properties for systems of indistinguishable particles and for systems of 
distinguishable particles, respectively. 

7.D.I. Probability Density Operator 

In order to obtain the probability density operator for a closed system we must 
extremize the Gibbs entropy subject to two constraints. We require that the 
probability density operator, p, be normalized. 

Tr N (p) = 1, (7.37) 

and we require that the average energy be fixed to some value ( E ), 

T r N (H N p) = {E). (7.38) 

If we introduce the Lagrange multipliers, ao and a E , we can find the probability 
density operator, p, which extremizes the entropy subject to the constraints, 
Eqs. (7.37) and (7.38). The extremization condition is 

<5{Tr;y[a!oP + oz E H N p - k B p ln(p)]} 

= Tuv[{(ao -k B )I + a E H N ~ k B \n(p)}6p] = 0, 

where 7 is the unit operator. Since 6p is arbitrary, we must have 

(ao — k B )I + oceHn — k B ln(p) = 0, 


(7.40) 
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and therefore 


p = exp 


Qq _ j 

k-B 


kB 


(7.41) 


The two Lagrange multipliers, olq and as, can be determined from the 
constraints Eqs. (7.37) and (7.38). From Eqs. (7.37) and (7.41) we find 

Z N (oc E ) = exp(l - (7.42) 

The quantity Z^(T) is called the partition function. We next determine the 
Lagrange multiplier, <*£. Let us multiply Eq. (7.40) by p and take the trace. We 
obtain 

Tr/v[(a;o — ks)p 4- a^H/vp — ksp ln(p)] = — kj? ln[Zw(a!£)] + oce(E) + S = 0, 

(7.43) 

where we have made use of the constraints, Eqs. (7.37) and (7.38), and the 
definitions of the partition function Eq. (7.42) and the Gibbs entropy Eq. (7.3). 
From Section 2.F.3 the fundamental equation for the Helmholtz free energy can 
be written A - U + ST = 0. If we note that the internal energy is U = (E), then 
we can make the identification ole — — 1 /T and we find that the Helmholtz free 
energy is 


A = -k B T In [Z n {T)\. 

The partition function, Zn(T), takes the form 

Z N (T) = e~» A = Tr„(<T' ,fe '), 


(7.44) 


(7.45) 


where j3 = \/ksT. The probability density operator can be written 


p = e - 0 ^ n ~ a) 


e -m N 

Tx N {e-^) ' 


(7.46) 


Equation (7.46) is the probability density for the canonical ensemble. The trace 
is evaluated using any convenient complete orthonormal set of Wparticle basis 
states. However, as we shall show below, we must make a distinction between 
systems of N distinguishable particles and systems of N indistinguishable 
particles. 

It is important to note that Eq. (7.46) was obtained by extremizing the Gibbs 
entropy. Historically, the Gibbs entropy was chosen because it gave this result 
[1, 2] for the probability density of a system at temperature T. At high 
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temperature, where equipartition of energy occurs, Eq. (7.46) gives the correct 
relation between the temperature and the average energy. 

Equation (7.44) is the Fundamental Equation for a closed system. From it we 
can obtain all thermodynamic quantities. For example, the entropy is given by 
S = -(dA/dT)x The generalized force is given by Y — {dA/dX) T N . The 
chemical potential is given by // = (dA/dN) T x . Another useful relation for the 
internal energy is U = ( d/3A/d(3) x N . 

In the canonical ensemble the temperature, T, is fixed and the average 
energy, U — (E), is fixed. However, because there can be a flow of energy in 
and out of this system, it is important to know how large the fluctuations in the 
energy about the average, (E ) , will be. Let us therefore compute the variance of 
the energy fluctuations. We first write the normalization condition 

Tr„(^(T^)-"»]) = l, (7.47) 


If we differentiate Eq. (7.47) twice with respect to (3 , we find 




|74 « 


This gives 


{El) -^=-(cw) x = kBT2Cx ’ N ' 


(7.49) 


where C x ,n is the heat capacity at constant X and N. The heat capacity, CV,/v, 
and average energy, (E), are each proportional to N. Therefore, the fractional 
deviation behaves as 


\/<£ 2 }-(£> 2 

(E) 


AT 1 / 2 


(7.50) 


and goes to zero as the number of degrees of freedom becomes infinite. This 
means that the fluctuations in energy become very small relative to the 
magnitude of the energy itself. In the thermodynamic limit, most microstates 
will have an energy approximately equal to the average energy, U = (E), and 
the canonical ensemble becomes equivalent to the microcanonical ensemble. 

In evaluating the trace in Eq. (7.45) we must distinguish between systems of 
indistinguishable particles and systems of distinguishable particles. A system of 
N indistinguishable particles, by definition, has the property that the 
Hamiltonian and all other physical observable remain invariant under permu- 
tation of the particles. We will consider both cases below. 
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7.D.2. Systems of Indistinguishable Particles 

As we show in Appendix B, for systems of identical particles we can evaluate 
the trace either in terms of complete sets of symmetrized or antisymmetrized N- 
body position or momentum eigenstates, or in terms of the “number” 
representation. However, the fact that in the canonical ensemble the number of 
particles is restricted to N makes the particle number representation unwieldy 
when using the canonical ensemble. As we have shown in Appendix B [Eqs. 
(B.30) and (B.32)], the trace of the probability density for a set of N identical 
particles may be written 

Tcn(p) = J7\ (k>---Mp\k u ..., k N ) (±) = 1, (7.51) 

^’* 1 , 

where the states |fci , . . . , k N ) denote “momentum” eigenstates for the N- body 
system. The symmetrized state \k\, . . . must be used for bosons. The 

antisymmetrized state \k\, . . . ,k^)^ must be used for fermions. The 
“momentum” eigenstates may be written as the direct product of the 
“momentum” eigenstates of the constituent particles, 


I k a ,k b , ...,ki) = \k a ) x \k b ) 2 x • • • x jfc/) 


N- 


(7.52) 


On the left-hand side of Eq. (7.52), we use the convention that the “momenta” 
of the particles labelled from 1 to N are ordered from left to right from 1 
to N. Thus, particle 1 has “momentum” k a , particle 2 has “momentum” 
k b ,. . . , particle N has “momentum” fy. (When we say “momentum” we 
include spin and any other internal degrees of freedom of a particle that 
might affect the thermodynamic properties of the system.) The state | kn) x is 
the “momentum” eigenstate for particle 1. The state k N ) {±) is 

defined as 


i*i = £(±l)'>|* 1 , (7.53) 

P 

where ^2 P P denotes the sum over all permutations of the momentum in 
the state |&i,...,fyv) (cf. Appendix B). In Exercise 7.3 we evaluate the 
partition function for the (unrealistic) case of a three-body system of identical 
particles. 


S ■ EXERCISE 7.3. Compute the partition function, Z^iT ), for an ideal gas 
of three identical particles (bosons or fermions) in a cubic box of volume 
V = L\ Assume the walls of the box are infinitely hard. For symplicity, 
neglect any spin or other internal degrees of freedom. What approximations 
| can be made for a high-temperature and low-density gas? 
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Answer: The Hamiltonian for this three-body ideal gas is # 3 = p\/2m+ 
P 2 / 2 m + pl/2m, where p, = /ik; is the momentum operator for the ith 
particle and k,- is the wave vector of the ith particle. 

It is instructive first to work out the partition function for a single particle, 
Zi(T) = Tr 1 [exp(-/3p 2 /2m)] = Y,^\ ex P{-0^ 2 /2m)\]s), where |k) is 
orthonormal eigenstate of the wavevector operator, k = p /h and 
( k | k') =S k y. Since the particle is confined to a cubic box of volume 
V = 1} and the walls of the box are infinitely hard, the kinetic energy 
eigenstates can be written 

^n x ,n y ,n z (x, y, z) = A sin sin i~~-) sin , (1 ) 

where n x ,n y , and n z are integers 1,2, ...,oo, and A is a normalization 
constant. The allowed wavevectors are 


. n x 7r . n v 7r. n 7 n, 

k =x ,+ ±> + ± k ' 


( 2 ) 


where i, j, and k are unit vectors in the x,y, and z directions, respectively. 
The single particle partition function can be written 


00 00 00 

«*= 1 n y = 1 n z =l 


' pTr 2 # 
2mL 2 



( 3 ) 


If the volume, V, is very large on microscopic scales, we can change the 
summation to an integration in Eq. (3). Let us note that 5^*=i = Ji°° dn x , and 
make a change of variables, p x = n x 7rh/L. For very large L, Jm/L « 0 and 
we find 


Z,(7-) = 


7 r 3 h 3 

V_ 

V 


dp x 


dp x 


dp y 


dPy 


P OO 


(3 


o dp z exp | — — {p 2 x + Py + p 2 z ) 


dp z exp 


-ti {p2 ^ p2 y +p " ] 


\ y 

( 4 ) 


where 


\/27rmkBT 

is the thermal wavelength and has units of length. As we shall show below, 
the thermal wavelength is a measure of the distance over which the particles 
maintain phase coherence in the gas. 
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The three-body partition function can be written 

Zj(r) = T < k >' k *> k 3- W k '- k 2> k 3) (±) - ( 6 ) 


k],k 2 ,k 3 


where 


|ki, k 2 , k 3 ) (±) = jki, k 2 , k 3 ) ± |ki, k 3 , k 2 ) ± |k 2 , ki, k 3 ) 

± |k 3 , k 2 , ki) + |k 3 , ki, k 2 ) + |k 2 , k 3 , ki). 


(7) 


If we note the orthonormality condition, (k a ,kfe,k c | k^,k e ,k/) = 
^kA^kA*/ then Eq. (6) becomes 


z 3 (r )=5? 

_ 1 

“ 3! 


{Z X {T)Y ± 3Zi(T)Zi ( - ) + 2Zi [ - 


^ ± 


£>+• 2 


4J (2f 2 (3f 2 \X T ) 


( 8 ) 


3! V A 3 , 


1 ± 


At 


+ 


(2f 2 \Vj " (3 f 2 \V 


The semiclassical limit corresponds to high temperatures and/or low 
densities (large V). In that case, we see from Eq. (8) that we can neglect 
terms proportional to \\/V and we obtain the semiclassical partition 
function for a gas of three identical particles: 


z ^4(S>- 


(9) 


For a gas of N identical particles, the semiclassical limit gives an 
approximate partition function: 


i / v\ 




( 10 ) 


The factor N! in Eq. (10) is exactly the counting factor used in the 
! microcanonical ensemble for indistinguishable particles. It resolves the 
j Gibbs paradox. 


As we have shown in Exercise 7.3, symmetrized and antisymmetrized states 
must be used when computing the partition function for systems of identical 
particles at low temperatures and/or high densities where quantum effects 
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strongly influence the translational degrees of freedom. However, for 
moderately high temperatures and/or moderate densities the partition function 
takes the approximate form 

Z N (T) Y, <*!.••• (7.54) 

' k u ...,k N 

where we now neglect the contributions from permuted states. In doing so, we 
are neglecting terms of order, (N/V) A^, where \t = hf \f2irmk B T is the 
thermal wavelength. Equation (7.54) gives a good description of a semiclassical 
gas of N identical particles. Quantum effects still must be taken into account in 
computing any contributions to the partition function due to internal degrees of 
freedom. To see this, let us consider a gas of noninteracting molecules which 
have internal degrees of freedom which can absorb energy. Such internal 
degrees of freedom might include the rotational and vibrational motions of the 
molecule and electronic or nuclear excitations. For such a system, the 
Hamiltonian of the ith molecule can be written H, = p?/2m + H i [ rot )+ 
Hi(vib) kli(eiec) T ^i(nuc) T ■ ■ ■ , where > kli( e i ec ) , zwid denote 

the rotational, vibrational, electronic, and nuclear internal degrees of freedom, 
respectively. In writing this Hamiltonian as we have, we have assumed that 
these various internal degrees of freedom are uncoupled from one another. For a 
gas of N uncoupled particles in a box of volume V, the partition function can be 
written 


Zv(F,r)=-Tr N 




The partition function takes a particularly simple form for this system if the 
Hamiltonians in Eq. (7.55) commute with one another. Then we find 


Zn{T , V) = Jyi (Z Htr) Zi ( rot ) Zivib Z\ (ei ec } Zi(„ MC ) X • • • ) N (7.56) 

where Z l(tr) = Tr i(e~^ 2m ),Z 1(rot) = Tn{e~ 0fl ^ rot) ), and so on. The trace, Tr, 
is taken over a complete set of single particle states appropriate for the 
Hamiltonian appearing under it. 

For a semiclassical ideal gas with no internal degrees of freedom, the 
partition function is (cf. Exercise 7.3) 

z * (r ’ v) =^(£fK3" (7 - 57) 

where we have made use of Stirling’s approximation, N\ fa (N / e) N . The Helmholtz 
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A = 


-k B T\n(Z N ) = —Nk B T - Nk B T\n 


t g y 3/2 l 

\27r mk B T ) I 
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(7.58) 


The entropy is 


S 



V,N 


= -Nk B +Nk B In 


/ h 1 \~ 3/2 
\27r mk B T ) 


(7.59) 


which is just the Sackur-Tetrode equation. In Exercise 7.4 we compute the 
thermodynamic properties for a semiclassical gas of N identical spin-^ particles 
in a magnetic field. 


j ■ EXERCISE 7.4. A cubic box (with infinitely hard walls) of volume, 
j V — l? , contains an ideal gas of N identical atoms, each of which has spin, 
t 5 = 5 , and magnetic moment, fi. A magnetic field, B, is applied to the 
| system, (a) Compute the partition function for this system, (b) Compute the 
j internal energy and the heat capacity, (c) What is the magnetiziation? 

| Answer: 

(a) The partition function takes the form 

z N =±(z lM f { z Hmag) y, (i) 

| where is the translational contribution for a single molecule and 

I Zj (mag) is the magnetic contribution for a single molecule. The single 

| particle translati onal parti tion function can be written Zi ( ,,) = V/A|, 

! where \j = h/ \f'hrmk B T is the thermal wavelength. 

| Let us now compute the partition function due to the magnetic 

I degrees of freedom. Each atom will have magnetic energy 

[ E(s) = — where s — ±1. The magnetic partition function for 

j a single atom is 

| z llmag) = £ = 2cosh(^) . (2) j 

! i=±i V L / 


The partition function for the gas is 


Zn 


_1_ 

A! 



N 

cosh^ 



(3) 
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(b) The internal energy is given by 

(d\nZ N \ 3 

V = ~\d0~ ) = 2 f 


= ^Nk B T -~N/iBtanhfl^ 


The heat capacity is 


Cv,N ~ 


= -Nk B +Nk B 


(c) The magnetization is given by M = -(d$/dB) T N ; where $ is the 
free energy for this problem (the translational part is like a Helmholtz 
free energy, a function of 7\ V, and N, and the magnetic part is like a 
Gibbs free energy, a function of T, B, and N). The free energy of the 
combined system doesn’t have a name so we call it 4>. Then 
= — K b T\ti(Zn) and 


M — — 


= '-Nv tanh(^ 


7.D.3. Systems of Distinguishable Particles 

For systems of distinguishable particles, the trace in Eq. (7.45) can be evaluated 
using a complete set of unsymmetrized /V-body states. For example, if we use 
“momentum” eigenstates to evaluate the trace, we have 

Tr N (p)= Yl (7-60) 

ki,...,k N 

One of the best examples of such a system is a semiclassical solid composed of 
N lattice sites. Even if the atoms at each lattice site are identical they do not 
have translational kinetic energy. They are located at different points in space 
and they can be distinguished by their position in space. An Einstein solid is an 
example of a semiclassical solid in which the atoms can be considered to be 
distinguishable because they remain at well-defined lattice sites. In Exercise 7.5 
we compute the thermodynamic properties of an Einstein solid using the 
canonical ensemble. 


■ EXERCISE 7.5 Use the canonical ensemble to compute the entropy, 
internal energy, and heat capacity of the Einstein solid described in 
Exercise 7.1. 



THE CANONICAL ENSEMBLE 


363 


Answer: The Hamiltonian for an Einstein solid with N lattice sites and 3N 
harmonic oscillators with frequency uj is 



(i) 


where n, is the number operator for energy quanta in the ith harmonic 
oscillator. The number operators have eigenvalues which are integers, 
n, = 0, 1,2, . . . , oo, and eigenvectors |n t ), so that n,|n,) = n,-|n,-). The parti- 
tion function is given by 


Zn{T) = Tr^ 


3 N 


exp — / 3hu ^ (hi + 


/=! 


( 2 ) 


We can evaluate the trace in Eq. (2) using the basis states |«i,n 2 ,..., 
« 3 v> = |«i> x • • • x |n 3A r). Then 


3 N 

z n(T) = Yl " ,n 3 Ar|exp( -/3/iu; (n,- + ±) j |ni, . . . ,n 3N ) 

n\— 0 n 3N —0 \ i=l 

oo oo / 3 N 

= • ■ * X! ex p +5) 


n t =0 n 3\=0 

00 


/— 1 


HE e ~^(ni+i) X • • • X E 


7 -0hu(n iN +\) | _ 




S,«3Af=0 


,-0tuv/ 2 \ 3N 


1 _ e -0Hu 


(3) 

| (Note that the number of particles is fixed at N. The number operator, hi 
! counts the number of vibrational quanta in the ith harmonic oscillator. Even 
though the number of particles is fixed at N, the number of vibrational 
I quanta has no restriction.) 

| The Helmholtz free energy is 

j A(T,N ) = -k B T\n(Z N (T)) = l Nhu + 3 Nk B T \n(l - e~^). (4) 


The entropy is 

s= ~(§f)„ = 3Nke ln(1 ~ e ' c "“ ) + 

The internal energy is 
U = 


3 Nhu) e~^ 

T (\-e'^)‘ 


,—phu> 


- = - Nhu -{- 3 Nhuj — 

d(3 ) N 2 


(5) 


( 6 ) 
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The heat capacity is 


Cki = 


fdU\ _ 3AftV e~^ 

\df) N ~ ksT 2 ( i- e 


( 7 ) 


7.E. HEAT CAPACITY OF A DEBYE SOLID [6, 7] 

The intermolecular cohesive forces which bind the simplest molecular crystals 
are the same as the forces which give the correction terms in the van der Waals 
equation of state (Section 2.C.3). The short-range repulsive forces between 
atoms make crystals virtually incompressible. The longer-range attractive force 
holds the atoms of the crystal in place at some fixed lattice spacing, although 
the atoms can undergo oscillations about their lattice positions. To first 
approximation, a crystal may be viewed as an ordered array of atoms, each one 
fixed to a lattice site and free to oscillate about the lattice site. While the 
potential which governs the oscillations is anharmonic, to lowest approxima- 
tion, in deriving the heat capacity, we can treat it as harmonic. In this section 
we shall compute the heat capacity for a three-dimensional harmonically 
coupled lattice using a continuum approximation of the type first proposed 
by Debye and we will compare it to experiment. (In Section S7.A we obtain 
an exact expression for the heat capacity of lattice vibrations for a one- 
dimensional lattice and compare Debye theory to the exact result for that 
case.) 

Let us consider the contribution to the heat capacity of the sound modes in a 
simple three-dimensional harmonic lattice. Consider a rectangular lattice with 
sides of lengths L x , Ly, and L z . We will assume that all sides are pinned so that 
the sound waves form standing waves with wavelengths 2L x /l x ,2Ly/l y , and 
2 L z /l z in the x,y, and z directions, respectively, where l x , l y , and l z are integers. 
The a* sound mode will have a dispersion relation of the form 



The Hamiltonian operator takes the form 

N IV 

H = hcu a (a j a a a 4- 5I) = ^2 huj a (n a + \ 1), (7-62) 

q =1 a= 1 

where h a = a f a a Q is the number operator for energy quanta (phonons) in the a th 
normal mode. Let \n a ) be the eigenvector of the number operator, h a . Then 
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n Q |n Q ) = n a \n a ). The partition function can be written 


Zn(T) — Tr^v 


exp ( -(3 ^2 huJ a {n a + 


a= 1 


(3hu a 


N —/3fao a /2 

=n T 


a= 1 


g—f3hu) a 


(7.63) 


= n( 2S ' nh V 2 


The average energy is 

d\n(Z N ) ^ huo a / 2 


(E) = ~ 


d(3 


=£ 


Q=1 tanh 


(3hu a 


E hu a \ — \ hujfy 


, 2 ' - 1 ' 

Ct— 1 Q=1 


(7.64) 


Since the average can also be written ( E ) — (5^a=i ^a(«a + 1)), comparison 
with Eq. (7.64) shows that the average number of quanta in the phonon 
mode is 


{h a ) = 


1 


e dhu a _ J ’ 


(7.65) 


which is Planck’s formula. 

The allowed values of u n in Eq. (7.61) consist of discrete points in a three- 
dimensional frequency space. The distance between points in the x direction is 
nc/L x , in the y direction it is nc/L y , and in the z direction it is 7 rc/L z . The 
volume per point in frequency space is therefore (nc) 3 /V, where V = L x LyL z is 
the volume of the crystal. The number points per unit volume in frequency 
space is V/{n c) 3 . The total number of allowed values of u a less than some 
value uj is given by i (47ru; 3 /3)(F/(7rc) 3 ), where | (47ru; 3 /3) is the volume of | 
of a sphere of radius u (the phonon frequency is positive). Thus, the number of 
allowed frequencies in the range u — ► u + du is given by 

dl = 2^? u2dUJ ' (7 ’ 66) 

In general, there will be two transverse sound modes and one longitudinal sound 
mode since crystals can sustain both longitudinal and transverse sound modes (a 
fluid can only sustain longitudinal sound modes). The transverse and 
longitudinal sound modes in a crystal propagate at different velocities which 
we shall denote as c t and c/, respectively. If the three different sound modes are 
taken into account the number of allowed frequencies, dn, in the interval, 
to — ► u + duo , is 


dn = 


V 

2n 2 




up - duo. 


(7.67) 
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Since there is a minimum wavelength allowed on the lattice due to the finite 
spacing of the atoms, there will be a cutoff frequency, u D (the Debye 
frequency). If there are N atoms on the three-dimensional crystal, there will be 
3N sound modes. We can then determine the Debye frequency by relating it to 
the total number of modes, 


3 N 


f 3 * J V (2 1\ 

.1 H 2tt 2 U 3 + cj) 


ru)D 2j V (2 l\w 3 


(7.68) 


If we solve for cu 3 D , we find that the Debye frequency is given by 


LOrt — 


18JVtt 2 /2 1 




vi 


-i 


(7.69) 


The density of states can be written 


*W=r.= 


dn Vuj 2 (2 1 


( 5 I T I — 


9 Nu 2 


(7.70) 


du) 2 tt 2 \c 3 ' c 3 y 
Once we are given the density of states, g(co), the average energy is given by 


<*>“2 


Up 


( UJ L 

hum((jj)g{u)du. (7.71) 

o 


We then find 


(E) = 


9Nhu D 9 N f WD hu 3 duj 


8 


+ 


9 N p 

Jo 


e 0fw _ i ' 


(7.72) 


The heat capacity is 


Cm = 


9 Nk B f WD J (/ 3hu) 4 e 9 Nk B [ Td/t , x 4 ^ 


h 2 P> 2 u . ) 3 d io [eP 1 ™ — l ) 2 (HPivd ) 3 Jo (e* — l ) 2 ’ 


dco 


dx 


(7.73) 


where the Debye temperature, T D , is defined T D = hu D /k B . In the limit T — > 0, 
the heat capacity becomes approximately 


Cm 


9 NIcb 

(h0u D y 


f 


dx- 


x 4 e x 


At 3 


(^-i Y 


12NJc B 7t 4 T 

5 T 3 


(7.74) 


These results are in good qualitative agreement with the experimentally 
observed heat capacity of many solids. We give two examples in Fig. 7.2 
where we have plotted the prediction of Debye theory and have compared 
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Qv. 


3 R 



Fig. 7.2. The solid line is the heat capacity curve predicted by Debye theory. The 
dashed line is that predicted for Einstein solid. The circles and triangles are 
experimental values for the heat capacity of A1 (T D = 390 K) and Cu (T D = 315 K), 
respectively. (Based on Ref. 8.) 


it to experimental results for aluminium and copper (using the appropriate 
Debye temperatures for those substances). Experimentally, one finds that at 
high temperatures the heat capacity, C N , is roughly constant with C N « 
3 Nk B = 3 nR, where n is the number of moles and R is the gas constant. 
However, as the temperature approaches T =0 K, the heat capacity goes to zero 
as T 3 . 

It is interesting that classical physics cannot give the correct behavior for the 
heat capacity of solids at low temperature. A classical harmonically coupled 
lattice with N atoms has a heat capacity, Cm = 3Nk B , which is independent of 
temperature. The first theory to give the correct qualitative behavior of solids at 
all but very low temperatures was due to Einstein. He assumed that the solid 
was composed of N uncoupled harmonic oscillators. The thermodynamic 
properties of the Einstein solid have been computed in Exercises 7.1 and 7.5. 
The heat capacity of an Einstein solid goes to zero as T — ► 0 K and therefore 
gives a vast improvement over classical theory. It accounts for the fact that only 
the longest-wavelength phonons can be excited at very low temperature, 
whereas classical theory allows them all to be excited. However, Einstein theory 
gives a vastly oversimplified density of states. It assumes that all phonon modes 
have a single frequency and therefore that the density of states is 
g(io) = 3 N6(cu — uo), where ujq is the frequency of all the phonon modes. In 
real solids, the phonon modes have a distribution of frequencies, and this is 
more accurately taken into account by the Debye (continuum) theory. The heat 
capacity for the Einstein solid goes to zero exponentially with temperature and 
therefore does not give the correct low-temperature behavior for a solid. In Fig. 
7.3 we compare the density of states for the Einstein solid and the Debye solid. 

The actual density of states for some solids has been measured using neutron 
scattering techniques. Neutrons interact with the nuclei of atoms in the solid 
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ZN 



Fig. 7.3. Plots of the density of states of Einstein and Debye solids. 



Fig. 7.4. The density of states for aluminum, obtained from neutron scattering 
experiments. Reprinted, by permission, from R. Stedman, et al., Phys. Rev. 162, 549 
(1967). 


and are scattered by phonons, which are simply the normal modes of the atomic 
oscillations. In Fig. 7.4 we show the density of states for aluminum. The low- 
frequency contribution looks somewhat like the Debye result, but at high 
frequencies the Debye density of states is completely inadequate. It is 
interesting to note in Fig. 7.2 that the heat capacity of aluminum at high 
temperatures is not sensitive to the density of states. This is a result of the 
equipartition principle. At high temperatures the average energy of an oscillator 
is kgT and is independent of frequency of the oscillator. 
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7.F. ORDER-DISORDER TRANSITIONS 

An important class of systems that can be studied using methods of equilibrium 
statistical mechanics are those which exhibit a transition from a disordered to an 
ordered state. One of the simplest examples of such a system consists of a 
lattice of N spin-^ objects with magnetic moment p. Each lattice site can 
interact with the magnetic fields of their nearest neighbors and with any 
external applied magnetic fields that might be present. The Hamiltonian for 
such a system can be written 

N 

H = €ij Si Sj - pB ^2 s ‘i (7-75) 

m <-=i 

where denotes the sum over nearest-neighbor pairs ij (one must be careful 
to count each pair only once), e# is the magnetic interaction energy between 
nearest neighbors i and j, si is the z component of spin at the ith lattice site, and 
B is the external magnetic field. For spin-| objects, s t = +1(— 1) if the spin of 
site i is oriented in the positive (negative) z direction. There is no kinetic energy 
in Eq. (7.75). The Hamiltonian only contains information about spin orientation 
and the spatial distribution of lattice sites. If Cy < 0, then for B = 0 the 
lattice will have its lowest energy when all the lattice sites have spin up or 
all the lattice sites have spin down, both cases being equally probable. If 
B 0, then the configuration in which all lattice sites are oriented with spin up 
will be energetically favored. Similarly, if Cy>0, then for B — 0 the configu- 
ration in which neighboring spins are oriented opposite to one another will be 
favored. 

The partition function for this spin lattice can be written 


si\, (7.76) 


where Yh a ii CO nfi g . denotes the sum over all 2 N possible different configurations of 
spin on the lattice. The partition function introduces an additional influence, 
that of thermal energy, IcbT. While the magnetic interaction energy, H, will 
cause the spins on the lattice to become ordered, the thermal energy, k B T, will 
tend to randomize the spins on the lattice. It is these two competing influences 
which lead to an order-disorder phase transition on the spin lattice. At low 
temperature, the lattice will be ordered. As we raise the temperature, at some 
point the order disappears and the spins become randomly oriented. The 
system, described by the partition function, Eq. (7.76), is called the Ising model 
and was originally used by Ising [10] as a model for ferromagnetism. However, 
the model also has been used to describe the behavior of lattice gases, binary 
alloys “melting” of DNA, and so on. 


r 

N 

^ ^ exp CjjSjSj -j- 

all config. I {y} i=l 
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The Ising model can be solved exactly, and analytic expressions for its 
thermodynamic properties can be found in one and two dimensions, but no one 
has ever succeeded in solving it analytically in three dimensions. In one 
dimension it does not exhibit a phase transition at finite temperature, but in two 
dimensions it does. In one dimension the lattice does not have enough nearest 
neighbors for the ordering effects of the interaction energy to compete 
effectively with the disordering thermal energy. However, for two or more 
spatial dimensions it does. The Ising model was first solved in two dimensions 
by Onsager [1 1-13]. It is one of the few exactly soluble models which exhibit a 
phase transition. 

In this section we shall consider two simple versions of the Ising model. We 
will first obtain exact expressions for the thermodynamic properties of a one- 
dimensional Ising lattice. The one-dimensional lattice does not exhibit a phase 
transition at finite temperature. However, the method of solution contains some 
ideas which are used to solve the much more difficult case of a two-dimensional 
lattice. We will solve the two-dimensional case in Chapter 8. In this section we 
will also compute the thermodynamic properties of an Ising lattice with d 
dimensions in the mean field approximation. The mean field approximation of 
the ^-dimensional lattice does have a phase transition at finite temperature. 

7.F.I. Exact Solution for a One-Dimensional Lattice 

Let us consider a one-dimensional periodic lattice that consists of N lattice sites 
evenly spaced, as shown in Fig. 7.5. We will assume that all nearest neighbors 
have the same interaction energy, e,y = — e, so that the configuration with lowest 
energy is one in which the spins are totally aligned. The periodicity of the 
lattice is imposed by assuming that j ,-+ n = s,. The total energy for a given 
configuration, {$,}, is 




N N 

-ey^SjSj+i - 

i=l /=! 


(7.77) 


i 



Fig. 7.5. A one-dimensional periodic lattice with N 
lattice sites. 
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The partition function can be written 

r * 

MT,B)= E E ex P 

•Sl=±l . 1=1 

where we have used the fact that Y?Li s * = iSiiC 5 ' + * s /+i) f° r a periodic 
lattice. It is now convenient to introduce a 2 x 2 matrix, 


( CSiSi+i +~nB(si + s i+ 1 j 


(7.78) 


/ e 0(e+nB) e -/3e \ 

y e~P e eP^-v 8 ) ) ’ 


whose matrix elements are defined as 


(7.79) 




(7.80) 


The partition function may then be written 


z n (t,b) = Y ■■■ E <*i|rh>HP|*3}---{%iPki} 

.?i =± 1 sn = ±1 


= Y WP*l*i> = ^[P"] = K + A" = K 

S l=±l 


1 + 



(7.81) 


where A± are the eigenvalues of the matrix P. We shall use the convention 
A + > A_. The eigenvalues of P are easily found to be 


A± = e 0e | cosh (PfiB) ± yj cosh 2 ((3 fiB) — 2e~ 2/?c sinh(2/?e) . (7.82) 


In the limit N — ► oo, only the largest eigenvalue, A + , contributes to the 
thermodynamic quantities. This is easily seen if we note that the Gibbs free 
energy per site is 


^> B ) = (7.83) 

= -k B T ln[A+]. 

In Eq. (7.83), we have used the fact that lim A (- > oo(A-/A + ) jV = 0. Thus the 
Gibbs free energy per site is 

g(T , B) = — e — k B In fcosh(/fyt£) + y^cosh 2 (/fyjfi) - 2e _2 / ?c sinh(2/?e) j . 


(7.84) 
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The order parameter is given by 

0 = -(A)= ~r • ( 7 - 85 ) 

T ^ cosh 2 (p^B) - 2e~ 2 ^ e smh(2/3e) 

From Eq. (7.85) we see that the one-dimensional Ising model cannot exhibit a 
phase transition because when B —*■ 0 the order parameter also goes to zero. 
Hence, no spontaneous nonzero value of the order parameter is possible. The 
exact solution to the two-dimensional Ising model is givin in Chapter 8, and it 
does allow a phase transition. 


7.F.2. Mean Field Theory for a ^-Dimensional Lattice 

We can obtain approximate analytic expressions for the thermodynamic 
properties of a d-dimensional Ising lattice using the mean field approxima- 
tion first introduced by Weiss [14]. In the mean field approximation, the 
Hamiltonian of a ^/-dimensional spin lattice with N lattice sites can be written 

N N 

Si ~ v B Yl Si = - 

i=l i= 1 


y>(«,SK (7.86) 


1 N 

H = -~^ve{s) 

2 tr 


where e is the coupling constant, v is the number of nearest-neighbor spins, and 
E(e,B ) = \ ve{s) -(- i^B. The factor of | ensures that we don’t count the same 
pair of spins twice. The quantity ( s ) = (sj) is the average spin per site. The 
quantity ue(s)si is an average magnetic interaction energy between site i and its 
nearest neighbours, assuming that the neigbors all have spin (s). As we shall 
show below, the average spin per site, ( s ) , must be determined in a self- 
consistent manner. 

The partition function can be written 

Z N =(j2 e0ES = (2cos h{(3E)f. (7.87) 

The Gibbs free energy per site is 

g(e,B) = -k s T lim ( ^ln(Z w ) ] = -i s nn[2cosh(/3£:)]. (7.88) 

A/— *00 \yv / 


The probability P(si) that site i has spin 5 , is 

e /3Est e /3Est 

P(Sl) = E„.±i e 0E " = 2cosh(/3E) ' 


(7.89) 
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Note that the probability P(s,)has an especially simple form in the mean field 
approximation because coupling to nearest neighbours is taken into account in 
such a simple manner. 

The average magnetization of the lattice is given by 


<M) = N/jl{s), 


(7.90) 


where 


(s) = s '^e,i = tanh ! ;3£ ] = tanh [P M s ) + j ■ 


(7.91) 


The magnetization is the order parameter for the spin lattice. If B = 0, the 
magnetization will be zero for the high-temperature paramagnetic phase of the 
lattice (randomly ordered spins) and it will be nonzero at lower temperatures 
where the spins have spontaneously aligned. 

We can determine the critical temperature, T c , at which the lattice starts 
to become ordered as temperature is lowered (the Curie point) from the 
expression for the average spin per site, {s). Let us write (s) for the case 
B = 0; 


{s) = tanh | ^five(s) 


= tanh 


i/e(s) 

2 k B T 


(7.92) 


We must solve Eq. (7.92) for (s). This can be done graphically by plotting 
fa = (s) versus (s) and fa = tanh (a (s)) versus (s) on the same graph. The 
solution to Eq. (7.92) is given by those points where the two curves cross — 
that is, where fa =fa. In Figs. 7.6a and 7.6b, we plot fa and fa versus (s) 
for a < 1 and a>l, respectively. For the case a < 1, there is only one 
crossing point and it is at (s) = 0. For a>l, there are three crossing 
points, at (s) = 0 and at (s) = ±so- The free energy per site for these various 
cases is 


r-fc B 71n(2) 

\ —ksT In [2 cos h(^ (3veso)\ 


if(s) = 0, 
if(s) = ± s 0 . 


(7.93) 


Thus, the values, (s) = ±jo [when they are solutions to Eq. (7.92)] describe 
possible states of thermodynamic equilibrium since they minimize the free 
energy. The transition point (critical point) occurs at a = 1 in Fig. 7.6 and 
therefore when ve/lksT = 1. Thus the critical temperature in the mean field 
approximation is J = T c — ue/2k B - In Fig. 7.7, we plot the order parameter (s), 
versus T/T c . 

We see that mean field theory predicts a phase transition at a finite 
temperature for a ^-dimensional lattice. This does not agree with our exact 
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Fig. 7.6. Plots of /i = (s) versus (s) and f 2 = tanh(a(j)) versus (s). (a) a < 1. (b) 
a > 1 . 



Fig. 7.7. A plot of the order parameter, (s), 
versus the reduced temperature, T/T c . 


result for the case d = 1 (cf. Section 7.F.1) where we found no phase transition 
at finite temperature. As we shall see in Chapter 8, mean field theory gives too 
high an estimate of the critical temperature for spatial dimensions, d < 3. It 
gives good estimates for d > 4 which is not of physical interest but is of 
mathematical interest. 
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Let us next examine the behavior of the heat capacity in the neighborhood of 
the critical temperature. The internal energy for B = 0 is 


U = - 


1 dZ N 
Z N d(3 


-^Nue{s) 2 . 


The heat capacity is 


But 


C N 




= NIcbV€/3 2 (s) 

N 






= ve ( s ) 

V ) n 3 cosh 2 (s)^j — /fr'e 

The heat capacity finally becomes 

c NksfJWttf 2w B (j) 2 (r c /r) 2 

N 2cosh 2 (fi y (s) J — fiut [cosh 1 ((s)T c / T) — T c /T] 


(7.94) 


(7.95) 


(7.96) 


(7.97) 


(7.98) 


In Fig. 7.8 we plot the heat capacity as a function of temperature. We see that 
the heat capacity has a finite jump at the critical temperature. It reaches a 
maximum value of 3N kg at T — T c . 


C N 



Fig. 7.8. A plot of the heat capacity, Cn, as a function of temperature in the 
neighborhood of the critical point. 
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The final quantity we wish to compute is the magnetic susceptibility, 
Xt,n(B). The magnetic susceptibility is defined as 

»■*(*) (799) 


From Eq. (7.91) we can write 

'd(s) 


SB 


= sech 2 P^(s) + fiiiB j 
T N *-2 J 




2 V dB 


T,N 


or 




2/3/i 


/ t,n 2cosh 2 P^(s)+ /3/rfiJ — five 
The magnetic susceptibility, xt,n(B), is then given by 
r (R) 

Xt,n{B) . 7 ve 7 • 

2 cosh 2 1 p — (s) + /3jiB j — five 

The magnetic susceptibility in the limit B — 0 is 

2/3/V/i 2 2 Nr? {T c /T) 


Xt,n{ 0) = 


(7.100) 


(7.101) 


(7.102) 


2 cosh 2 j/3— (j)j — (3ve ve cos ^ {{s)T c /T) — T c /T 


(7.103) 



Fig. 7.9. A plot of the magnetic susceptibility, x(0) rjV , as a function of temperature in 
the neighborhood of the critical point. 
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with (s) given by Eq. (7.92). Examination of Eq. (7.103) shows that the 
magnetic susceptibility has an infinite jump at the critical point. In Fig. 7.9, we 
plot Xt,n(0) as a function of temperature. 

The results we have obtained in this section for the thermodynamic 
properties of a spin lattice in the neighborhood of the phase transition are 
qualitatively similar to the results we obtained in Chapter 3 using mean field 
theories. Using statistical mechanics to obtain these results allows us to 
compute various constants that appear in the thermodynamic expressions 
in terms of microscopic interaction energies and magnetic moments. A 
generalization of mean field methods has been used by Suzuki to study critical 
phenomena [15]. As we shall see in Chapter 8, where we give some exact 
results for the two-dimensional spin lattice and use renormalization theory, 
mean field theory gives a rough qualitative picture of the phase transition, but it 
is not quantitatively correct. 


7.G. THE GRAND CANONICAL ENSEMBLE 

An open system can exchange both heat and matter with its surroundings and, 
therefore, both energy and particle number will fluctuate. For an open system in 
equilibrium, we fix both the average energy, (E), and the average particle 
number, ( N ). To obtain the equilibrium probability density in such a fluctuating 
environment, we use the method of Lagrange multipliers to extremize the Gibbs 
entropy subject to the following three constraints. We require that the 
normalization take the form 


Tr(p) = 1. (7.104) 

We require that the average energy have a fixed value, ( E ), so that 

Tr (Hp) = (E). (7.105) 

And finally, we require that the average particle number have a fixed value, ( N ) , 
so that 


Tr (Np) = (AT), (7.106) 

where N is the total particle number operator. 

We can now find the probability density operator, p, which extremizes the 
Gibbs entropy subject to the constraints, Eqs. (7.104)-(7.106). The extremiza- 
tion condition is 


6{Tr[a 0 p + a E Hp + a N Np - k B pln(p)]} 

= Tr[{(ao — k B )I + &eH + — k B ln(/&)}<5p] = 0, 


(7.107) 
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where c*o, a E , and a N are Lagrange multipliers. Since bp is arbitrary, we have 

(c*o - k B )I + a E H + a N N - k B \n(p) = 0. (7.108) 

We can now use Eq. (7.108) and the three constraints, Eqs. (7.104)-(7.106) to 
determine the Lagrange multipliers. The normalization condition, Eq. (7.104), 
allows us to introduce the grand partition function, 


Z(a E , a N ) = exp 



= Tr 




H + ~N 

k B 


(7.109) 


and relates ao to a E and To determine a E and a^, let us multiply Eq. 
(7.108) by p and take the trace. We find 

— k B ln[Z(a£, aw)] + ol e (E) + ol^{N) +5 = 0. (7.110) 

If we compare Eq. (7.110) to the fundamental equation for the grand potential, 
Q = U — TS — p'N (cf. Section 2.F.5), we can make the identifications, 
a E = -l/T and a n = p'/T, and 

Q(7V) = -fc 5 Tln[Z M (T)]. (7.111) 


Equation (7.111) is the Fundamental Equation for the grand potential. It is 
written in terms of the grand partition function, which is defined as 


Z m (T) = = Tr(e"^"^), (7.1 12) 


with (3 = 1 jk B T. The probability density operator can be written 


p = e -p{fi-vtN-n) 


Tr (e-0(H-n'N)) ■ 


(7.113) 


Equation (7.113) is the probability density operator for the grand canonical 
ensemble. Equation (7.112) is the Fundamental Equation for an open system. 
While we have not written it explicitly, the grand potential will generally 
depend on a generalized displacement, X, whose form depends on the 
mechanical properties of the system being considered. For example, for a gas, 
X-V is the volume and for a magnetic system, X = M is the magnetization. 
Once we know the grand partition function, we can compute the grand 
potential, and from it we can obtain all thermodynamic quantities. The entropy 
is given by S = —(d£l/dT) x ,. The generalized force is given by Y = 
(dQ/dX) T fl ,. The average particle number is given by (N) = -( dQ./dp') T ,x • 
In exercise 7.6 we use the grand canonical ensemble to compute the 
thermodynamic properties of a gas of photons. 
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■ EXERCISE 7.6 A cubic box of volume V = L? contains electromagnetic 
energy (photons) in equilibrium with the walls at temperature T (black-body 
variation). The allowed photon energies are determined by the standing 
waves formed by the electromagnetic field in the box. The photon energies 
are Hujj = hcki , where fc, is the wavevector of the ith standing wave. Compute 
the pressure of this photon gas. (Note that since photons have no mass, the 
gas has chemical potential, p' =0.) 

Answer: Since there are in infinite number of standing waves allowed, the 
grand partition function is 

OO 00 / 00 \ oo / | 

Z A T ) = S ' ' • J2 ex P = II ( 1 _ g-^1 

n\= 0 n 00=0 \ 1=1 / /= 1 ' 

The grand potential is 

00 

n = -PV = k B TY, ln 0 - e'P**). 

i=i 


(i) 


( 2 ) 


We can change the summation to an integration in the following way. The 
standing waves have wavevectors, k = {n x Tr/L)\ + (n y 7r/L)j + (n z n/L)k. 
These correspond to allowed frequencies, u ;, such that 

If we imagine a lattice of allowed frequencies, u — ck, the spacing per point 
in uj space is c'k/L (see the figure below). The volume per point is (ctt/L) 3 . 
The number of points per unit volume is ( L/ctt ) 3 . 



The number, v of allowed frequencies less than some value u is therefore 

3 z,V 


HG™ 3 )© = 


(4) 


6c 3 7T 2 ’ 

For each allowed frequency of the electromagnetic field, there are two 
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transverse modes. Thus, = 2 \du — (L 3 /7r 2 c 3 ) J 0 °° u?duj. The pressure 
therefore becomes 


P=- 


k sT_ 

7T 2 C 3 


uj 2 duj\n(\ -e-^). 


If we integrate by parts, we find 

n 


P = 


3c 3 7T 2 


u?dcj- 


1 




- 1 45c 3 h 3 3 


where cr = 7r 2 k p /l5c 3 h 3 is the Stefan-Boltzmann constant. 


( 5 ) 


(6) 


In the grand canonical ensemble, the temperature, T, and chemical potential, 
/i', are fixed and the average energy, U = (E), and average particle number, 
(. N ), are fixed. However, because there can be a flow of energy and particles in 
and out of this system, it is important to know how large the fluctuations are. 
The derivation of the variance in energy fluctuations is similar to the derivation 
given in Section 7.D.I. Let us derive the variance in particle number 
fluctuations. We first write the normalization condition in the form 

Tr(e 0[ = i. (7.114) 


If we differentiate Eq. (7.114) twice with respect to //, we can obtain 


Thus, the fractional deviation behaves as 


\/w- {NY 


N 


- 1/2 


(7.116) 


As the average number of particles increases, the size of the fluctuations in 
particle number comes small compared to the size of the average particle 
number. Most microstates will have a particle number approximately equal to 
the average number and we retrieve the canonical ensemble. 

It is useful to write Eq. (7.115) in terms of the isothermal compressibility. 
From Eq. (2.6), (dN/dfj,') TV = —(dN/dV) T , (dV/dfi') T N . From Eq. (2.102), 
(. dpf/dV) TtN = -( dP/dNjjy . From Eq. (2.6), {dP/dN) TV = ~{dP/dV) T ^ 
(dV /dN) TP . If we now combine these equations and note that (dN/dV) T y — 
(i dN/dV) TP = ( N)/V since p! = / i'(T,P ), we find 

,„ 2 , T (N) 2 fd(V)\ k„T(N ) 2 

(N>) - (N) = k B T(- w )^ - k B T - — — KT - 


(7.117) 
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Thus, the variance in particle number fluctuations is proportional to the 
isothermal compressibility. Near a critical point, the compressibility can 
become infinite and, therefore, fluctuations in the particle number (or density) 
become very large. 


7.H. IDEAL QUANTUM GASES 


When a collection of identical particles forms a gas, the particles must be 
regarded as indistinguishable. Therefore, at low temperatures, where the 
thermal wavelength of particles becomes large and overlapping of wave 
functions occurs, the statistics of the particles plays a crucial role in 
determining the thermodynamic behavior of the gas. All particles which are 
known today obey either Bose-Einstein or Fermi-Dirac statistics. At very low 
temperatures, Bose-Einstein and Fermi-Dirac gases behave in completely 
different ways because their different statistics introduce different kinds of 
correlations between particles even though they do not interact via Newtonian 
forces. At high temperatures, the statistics play no role and all ideal gases, 
regardless of statistics, behave in the same way. 

When working with systems of identical particles, it is often easiest to use 
the grand cononical ensemble because then it is not necessary to conserve 
particle number. The grand canonical ensemble allows the number of particles 
in the system to fluctuate, but keeps the chemical potential fixed. This becomes 
especially useful when dealing with systems which can undergo a phase 
transition to a superfluid or superconducting state. For an ideal gas of identical 
particles each of mass m, the grand partition function can be written 

Z m (7\ V) = Trfe - ^ 0- ^], (7.118) 


where H 0 is the kinetic energy operator and total particle number operator N is 
the number operator. 

For an ideal gas, it is most convenient to evaluate the trace in the number 
representation since both the energy operator and the number operator are 
diagonal in that representation (cf. Appendix B). We shall assume that our gas 
is contained in a “box” with periodic boundary conditions. We will let the 
volume of the “box” be V = L x LyL z , where L x , Ly, and L z are the lengths of the 
sides of the box in the x, y, and z directions, respectively. The momentum 
operator for a single particle can be written p = hk, where k is the wave- 
vector of the particle. For a “box” with periodic boundary conditions, the 
momentum operator has eigenvalues 


P i = K 


27 rl x . Inly . 

7T ,+ V 



(7.119) 


where l x ,l y , and l z are integers each ranging from — oo to oo. Let us denote the 
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set of integers, l = (l x ,l y ,l z ). Each different choice of integers /*, l y , l z 
corresponds to a possible state of the system. The kinetic energy operator for 
a single particle is p 2 /2m and has eigenvalues 


A =f = f _ ( 4 ^ l x , 4 A 2 | 4 * % 2 ) 

2m Wl 2m\Ll + Lj + I 2 )' 


(7.120) 


Let «/ = ni x j y j z denote the number of particles with momentum pj. Then the 
kinetic energy operator can be written H 0 = £]/«/£/, and the number operator 
can be written N 0 = Yli «/• The summation, ]T] Z , ranges over all possible values 
of the integers l x , l y , and l z . 

The difference between Bose-Einstein and Fermi-Dirac particles lies in the 
numbers of particles that can occupy a given momentum (and therefore kinetic 
energy) eigenstate. For a gas of identical Bose-Einstein particles there is no 
restriction on the number of particles that can have a given set of quantum 
numbers, l x , l y , and l z . Therefore, the trace in Eq. (7.118) must include the 
possibility that any numbr of particles, «i, ranging from 0 to oo, can occupy the 
momentum state, pi. This will be true for each possible momentum state. Thus, 
for a Bose-Einstein gas the grand partition function is 


Z be (T, V,M') = n 

/ 


X! ex p( -Pni(ei - p!)) 

ni= 0 


( 7 . 121 ) 


In Eq. (7.121), the product IIi^JJLo) can be written more explicitly as 


# LAJ 1 

n E = E x-x E x E x E x-x E 

/ \/I/= 0/ «-oo, -00,-00=0 ”-1,0, 0=0 ”0,0, 0=0 ”1,0, 0=0 ” 00,00,00— 0 

( 7 . 122 ) 


In Eq. (7.121) we have not explicitly included the possibility that the particles 
have (integer) spin or other internal degrees of freedom. However, it is 
straightforward to include them. 

For a gas of identical Fermi-Dirac particles, again, each different set of 
quantum numbers, l x , l y , and l z , corresponds to a state of the system. However, 
for Fermi-Dirac particles the Pauli exclusion principle restricts the number of 
particles, n\, which can occupy a given state, 1, to n\ = 0 or n\ = 1. Thus, for a 
Fermi-Dirac gas the grand partition function becomes 


Zfd(T , V,m') = II 

/ 


1 

p( -Pm(ei - p!)) 

ni=0 


( 7 . 123 ) 


In writing the partition function for the Fermi-Dirac gas, we have not explicitly 
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included the (half-integer) spin or other internal degrees of the freedom of the 
particles. We will do this below in Section 7.H.2, where we describe the 
behavior of Fermi-Dirac gases in more detail. 

7.H.I. Bose-Einstein Gases 

The grand partition function for a Bose-Einstein ideal gas is given by 
Eq. (7.121). It implicitly assumes that the bosons have spin s = 0, so spin 
degrees of freedom do not have to be taken into account. If we expand the 
exponential into a product of exponentials, each depending on a different value 
of/, then we can rearrange terms in Eq. (7.121) and write the grand partition 
function in the form 


Z BE (T,V, t i') = „, M) 


From Eq. (8.B.52), the grand potential can be written 


ii BE (T, V,h') = -k B T\n[Z BE (T, V,p')] = keT^ ln(l - « 

/ 

(7.125) 

The average number of particles in the gas is 



where («/) is the average number of particles in the state / and is defined 

<"'> = = fear -)• ( ? - 127 > 

The quantity z = is called the fugacity. 

The ideal Bose-Einstein gas is particularly interesting because it can 
undergo a phase transition. We get our first indication of this by looking at the 
way particles are distributed among the allowed states. Let us first note that 
since the exponential e^ 1 can only have a range of values 1 < e^ £l < oo, the 
fugacity must have values 0 < z < 1. Otherwise, («/) could become negative, 
and that would be unphysical since («*) is an average particle number. Thus, for 
a Bose-Einstein gas the chemical potential, // = k B T\n(z), must be negative or 
zero. This means that it is “easy” to add additional particles to the gas. Let us 
notice something else about the average particle number, (n/). For the state with 
quantum numbers / = 0 = (l x = 0, l y = 0, l z = 0), the energy, eo = 0, and the 



384 


EQUILIBRIUM STATISTICAL MECHANICS 


average particle number, (no), can become very large as z — ► 1 , since 



(7.128) 


The fact that lim^i (no) = oo means that the state with zero energy can become 
macroscopically occupied as z — > 1. This is precisely what happens at the phase 
transition. 

Let us now compute some thermodynamic quantities for the Bose-Einstein 
gas. The summations in Eqs. (7.125) and (7.126) cannot be done explicitly, but 
for large enough volume, V, we can change the summation, ^ z , to an 
integration. For simplicity, let us assume the gas is in a cubic box so that 
L x = L y = L z = L and V = L 3 . We first compute the average particle number, 
(N), which from Eqs. (7.120) and (7.126) takes the form 


OOOOOO 

w- EEE 

l X = — 00 ly = ~0O l Z ~ — O 



(7.129) 


where p x = 2 irhl x /L, p y = 2irhl y /L, and p z = 2irhl z /L. It is useful to change the 
summations in Eq. (7.129) to integrations. However, the term in the sum with 
l x = 0, l y = 0, and l z = 0 can become infinite as z — ► 1. Therefore it must be 
removed from the sum before we change the sum to an integration. The allowed 
values of the momentum, p, form a grid of points located at p x = 2nhl x /L, 
p y = 2irhl y /L, and p z — 2irt il z /L in p space. If we explicitly remove the point at 
p x = 0, p y = 0, and p z = 0, and change from Cartesian to spherical coordinates, 
we obtain 


£' 


r' 


dl = 


(2irh)' 


f dp = ^^3 f 
J (27 rft) 3 J2nh/L 


p 2 dp. 


(7.130) 


The summation, anc ^ integration, J'dl, exclude the point at l x = 0 , l y — 0, 
and l z = 0. The integration, j' dp, excludes a sphere of radius p < 2i rh/L about 
the origin. We can now rewrite the average particle number as 


7 47tV 

W = t- L - + 


1 — z (27 rhy J2nh/L 


[" P 2 dp( 

J2nh/L V 


exp(/?p 2 / 2 m) 


( 7 . 131 ; 


Similarly, we can write the grand potential in the form 


n BE (T, V,p) = k B T\n(\-z) + 


47rVk B T 
( 2irh ) 3 


2t rh/L 


p 2 dp ln[l - zexp(-/?p 2 / 2 m)], 
(7.132) 


where we again have separated out the point at the origin. 
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It is useful to introduce dimensionless variables. If we let x 2 = (3p 2 /2m, the 
grand potential becomes 

n BE (T,V^) = k B T\ a (\-z)+^f/r Jdx\a{\ - ztT*] (7.133) 

'W 7r J\ T \/n/L 

and the average particle number becomes 

(N) = f [ xldx ( ~Zc2 Z (7.134) 

1 ~z KV^JXt^/l \e* -z) 


where 


A j = 


/ 2ttH 2 \ 1/2 
\mk B Tj 


(7.135) 


is the thermal wavelength. The thermal wavelength is the deBroglie wavelength 
of a quantum particle with kinetic energy k B T, and it is a measure of the spread 
of a wavefunction of the particles. As long as A t is much less than the average 
interparticle spacing, the quantum statistics of the particles will not have a 
significant influence on the thermodynamic properties of the gas. However, 
when \ T becomes of the same order as the average interparticle spacing, the 
thermodynamic properties of the gas will begin to depend strongly on whether 
the particles are bosons or fermions. 

Let us now write Eqs. (7.133) and (7.134) in terms of intensive variables and 
examine the volume dependence of various terms. From Eq. (2.122), the 
pressure can be written 


OfiE k B T k B T f \t\Zk 

p = -— = -—^-z)+^n(z)-h[z,— 


(7.136) 


where 


and 


A ^ 

85/2{z) = — 7 = X 1 dx ln[l - ze~^} = V] 

v 7T J'*' ^ 


a— l 


a 5/2 


1 1 \ * k * T r 2 j i ri 

Ip(z,a) = - 3 x axln[l 

Ae^tt Jo 


ze 


-X 1 ! 


(7.137) 


(7.138) 


In Eq. (7.137), the summation is obtained by expanding the integrand in powers 
of z and integrating each term. 
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We can also rearrange Eq. (7.134) to obtain an equation for the average 
particle density. 


(n) = 


(N) 1 z 1 


VI - Z At 


+ TT g3/2(z) 


2 , 


A r\/7r\ 

~y 


(7.139) 


where 


£ 3 / 2 ( 2 ) = z— £ 5 / 2 ( 2 ) 



E 

Q=1 


2 ° 

a yi 


and 


(7.140) 


7/i ( 2 , a) 


VttAj. 





(7.141) 


It is straightforward to show lim a _ > o7p(l ? «) = 0. and lim a _ + o7n(l, «) = 0. 
Therefore, in changing from the summation to the integration, if the singular 
term / = 0 had not been explicitly removed, its effect would have been lost. 

The quantities £ 5 / 2 ( 2 ) and # 3 / 2 ( 2 ), which appear in Eqs. (7.136) and (7.139), 
are well-behaved functions of z. We plot them in Fig. 7.10. Both £ 5 / 2 ( 2 ) and 
£ 3 / 2 ( 2 ) remain bounded and approach finite values, 

* 1.342- and gi/1 (\) = c(|) ~ 2.612 , (7.J42) 


as z — * 1, where £0 and £0 are Riemann zeta functions. 



Fig. 7.10. Plots of # 5/2 (z) and # 3 / 2 ( 2 ) versus z. 
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Fig. 7.11. Plots of (1/V)[z/(1 - z)] versus z for (A) V = 10, (B) V = 100, (C) 
V = 1000. 


Finally, let us examine the functions — (1/V) ln(l — z ) and (l/V)[z/( 1 — z)] 
which appear in Eqs. (7.136) and (7.139). In Eq. (7.139), the intensive variables 
(n) and T appear explicitly. Let us fix (n) and T and let V — > oo and z — > 1 . In 
the neighborhood of z = 1, we will write z = 1 - 1 /(n 0 V), where n 0 is a 
constant, so z — ► 1 in a systematic manner as V — ► oo. Then it is easy to see 
that 


^(-Imd-zfV))) = 0 and (7.143) 

Thus, in the limit V — > oo, we can neglect the contribution to pressure due to 
the term — (1/V)ln(l — z), but we must retain the contribution to the average 
particle density due to the term (1/V) [z/ (1 — z)]. Let us now examine the 
behavior of the function (1/V)[z/(1 — z)]- In Fig. 7.11 we plot (1/V) 
[z/(l — z)], as a function of z for increasing values of the volume, V. Note 
that for any given value V, it always becomes infinite as z — > 1. The larger the 
volume, the closer z must be to z = 1 before the term (1 /V)[z/(1 — z)] becomes 
significant. 

Let us now take the thermodynamic limit in Eqs. (7.136) and £7.139). We let 
V — > oo and (N) — > oo so that (n) — ( N)/V remains finite. The pressure then 
takes the form 


P = 


< 




k B T 

A \ 


£5/2(2) 


kaT 

Aj- 


£5/2(1) 


if z < 1, 


if z = 1 . 


(7.144) 
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The average particle density takes the form 

/KT\ { 71 ^ 3 / 2 ( 2 ) if Z < 1, 

(n) = (J = |A r i (7.145) 

[^0 +-^-g 3/2 (l) if Z = 1 . 

A somewhat exaggerated plot of (n)X\,noX\, and g 3 / 2 (z) as a function of z is 
shown in Fig. 7.12. The plot is actually done at large but finite volume, so the 
growth of no\j can be seen more clearly. The main contribution to the growth 
of the quantity (n)A^ comes from g 3 / 2 (z) until z approaches z = 1. Then as 
z — * l,g 3 / 2 (z) approaches a constant and «oA^ determines the further growth of 
(n)Xj. This is called Bose-Einstein condensation. What we are seeing as z — * 1 
is the macroscopic condensation of particles into lowest energy state of the gas. 
The number density, no, of particles in the zero energy state, s 0 , becomes 
macroscopic in size. 

In the thermodynamic limit, Bose-Einstein condensation begins to occur 
when the fugacity, z, equals 1 (or when the chemical potential (n)) = 
k B T\n(z) ~ 0). Therefore, Bose-Einstein condensation begins to occur when 
the temperature and average particle number satisfy the condition 

<"} 4 = *3/2(1) « 2.612 (7.146) 

[cf. Eq. (7.145)]. Equation (7.146) enables us to write the critical particle 
density, ( n) c (the particle density at which condensation occurs), as a function 



Fig. 7.12. Plots of (n) Ay, #3/2(2), and noXj versus z. (The contribution of ti§)X T for 
z < 1 has been exaggerated by taking V = 100 rather than V = 00.) 
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of temperature: 



£3/2(1) 

X 3 t 


2.612 



3/2 


(7.147) 


where (v) c is the critical volume per particle. The critical temperature, T c (the 
temperature at which condensation occurs), as a function of particle density is 
given by 


\3 _ £ 3 / 2 ( 1 ) 


' 2nh 2 \ ( (n) \ 2/3 _ / 2 t rh 2 \ / (n) \ 2/3 

^b) \£ 3 / 2 (l)/ ~ \mk B J \2.6\2J 

(7.148) 


The order parameter, 77 , for Bose-Einstein condensation is the fraction of 
particles in the condensed phase, 77 = no/ (n). From Eqs. (7.145) and (7.148), 
we can write 


_ n o _ 1 £3/2(1) _ t \j c _ t f T\ 3/2 

V <"> Xj \rj 


(7.149) 


A plot of the order parameter as a function of temperature is given in Fig. 7.13. 

Equation (7.146) also determines the shape of the coexistence curve between 
the “normal” phase and the “condensed” phase of the gas. From Equation 
(7.144), we see that for particle densities, (n) > (n) c ((v) < (v) c ), the pressure 



Fig. 7.13. A plot of the order parameter, 77 = no/{n), versus temperature, T, for Bose- 
Einstein condensation. 
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Fig. 7.14. A plot of the coexistence curve (dashed line) and some isotherms in the P — v 
plane for the phase transition in a Bose-Einstein gas. 


becomes independent of particle density. If we now use Eq. (7.147), we can 
write the critical pressure, P c , as a function of the critical volume per particle, 

( v )c : 


27r^ 2 g 5/2 (l) 1 

m(gy 2 (l)) 5/3 <v)f ' 


(7.150) 


A plot of the coexistence curve, Eq. (7.150), together with some isotherms in 
the P-v plane, is given in Fig. 7.14. The region under the dashed curve is the 
coexistence region, the region where both condensed particles and non- 
condensed particles can coexist. 

Another quantity of great interest in the neighborhood of a phase transition is 
the heat capacity. From Eq. (2.127), the entropy per unit volume is 
5 = ( dS/dV) T y = ( dP/dT) V y(s = (dS/dV) TfJt , only if the gas is composed 
of single type of particle). Therefore, from Eq. (7.144) we can compute the 
entropy and we obtain 

- k B (n)\n(z) if z < 1, 
r (7.151) 

2 a££V 2 ( 1 ) if z = 1, 

where we have made use of Eq. (7.145). Given Eq. (7.151) for the entropy 
density, we can now compute the heat capacity/volume at constant density, 
c n = T(ds/dT) n . It is important to note that in order to compute c„, we hold n 
fixed and not pi . The computation of c n requires the following quantity, 

S' 17,53 
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Fig. 7.15. A plot of the heat capacity per unit volume for a Bose-Einstein ideal gas as a 
function of the temperature. The temperature, T c , is the critical temperature for the onset 
of Bose-Einstein condensation. 


Equation (7.152) is obtained by differentiating Eq. (7.145) with respect 
to T holding (n) fixed. The computation of c„ is then straightforward. 
We find 


Cn — 



kB 'ijrgin(z) - (n)k B if z < 1 , 

^sTa^M 1 ), ifz=l. 


(7.153) 


In Fig. 7.15 we plot the heat capacity per unit volume for the Bose-Einstein 
gas. The location of the critical point is clear in the plot. Also, the Bose- 
Einstein gas clearly obeys the third law of thermodynamics. In the limit 
T — ► OK, the entropy approaches zero with temperature dependence, T 3 / 2 . In 
the high-temperature limit, the heat capacity approaches a constant value as we 
would expect for a classical ideal gas. In the high-temperature limit the effect of 
statistics becomes negligible. The phase transition in an ideal Bose-Einstein gas 
in entirely the result of statistics. As we shall see, the Fermi-Dirac gas exhibits 
no such transition. 

The high-temperature behavior of the Bose-Einstein gas is readily obtained. 
At high temperature, z -> 0 and g 5 / 2 (z) « £ 3 / 2 ( 2 ) « £ 1 / 2 ( 2 ) = z. From Eq. 
(7.145) we obtain 


, . (N) z ( mk B T \ }/ 2 

(n)= l F = 4 =Uf) ‘ 


,/3m 


for the particle density, and from Eq. (7.144) we obtain 

k B Tz (N)k B T 


P = 


\ 3 t 


(7.154) 


(7.155) 



392 


EQUILIBRIUM STATISTICAL MECHANICS 


for the pressure. From Eq. (7.153) we obtain 

15 k B z 9k B (N) 3 (N)k B 

Cn 4X 3 T 4V 2 V 


(7.156) 


for the heat capacity per unit volume. Thus, at high temperature the Bose- 
Einstein gas behaves like an ideal classical gas. 


■ EXERCISE 7.7. Compute the variance, ((N — (A/)) 2 ), in the number of 
particles for an ideal boson gas (below the critical temperature) in the 
neighborhood of T = OK. 


Answer: The variance in the particle number is given by 


((N - ( N » 2 } 


0 V )t,v 


(') 


The average number of particles in a boson gas below the critical 
temperature is 

W=T i_ + Z; g3 / 2 ( Z ), (2) 

where # 3 / 2 ( 2 ) = The variance is then 

(( N - (N)) 2 ) =JZT z + (j^) + ^S,/2(z) > (N) + (N) 2 , (3) 


For an ideal boson gas at low temperature, the particle number distribution 
has a huge variance so it is not possible to give good predictions about how 
many particles are in the system at any given instant. 


7.H.2. Fermi-Dirac Ideal Gases 

We shall now examine the thermodynamic behavior of a gas of identical, non- 
interacting spin s = \ particles of mass m. For simplicity, we shall assume the 
gas is in a cubic box so L x = Ly = L z = L and we shall assume periodic 
boundary conditions. When we include the effects of spin we must generalize 
the expression for the grand partition function given in Eq. (7.123). Spin -5 
particles can exist in two spin states, s z — ±\h. Therefore, each momentum 
state can have two particles, one particle for each of the two spin states, and still 
not violate the Pauli exclusion principle. We will let ni a denote the number of 
particles with quantum numbers l = (4, l y , l z ) an d spin a, where a =| (j) for 
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s z — +\h(-\K). The grand partition function takes the form 


00 / 1 


Zfd{T,VJ) = n( E E e 

/=0 \ MJ ,=0 ni 1=0 

=n( i +^-"' ) ) 2 - 




(7.157) 


The power of 2 is due to the fact that there are two possible spin states for each 
set of quantum numbers, l. If we are dealing with a gas of spin-5 fermions, then 
there will be g = 2s + 1 spin states for each value of / and the partition function 
is given by 


Z fd (T, V,/0 = no + (7.158) 

/ 

The grand potential is given y [cf. Eq. (8.B.52)] 

Sl FD (T,V,pt) = -k B T\n[Z FD (T, V, /*')] = ~Wg 

l 

(7.159) 

where g = 2 for spin-1 particles. The average number of particles in the gas is 




where (n{), the average number of particles with quantum numbers l, is defined 
as 


w = (^rr) = (^)- < 7 ' 16, > 

In Eq. (7.161) the quantity z = is the fugacity. For Fermi-Dirac particles, 
the average particle number has no possibility of diverging. The fugacity can 
take on the entire range of values 0 < z < oo, and the average particle number 
can take on a range of value 0 < (n{) < g. In Fig. 7.16 we plot («i) as a function 
of £[ at low temperature (solid line) and at T = OK (dashed line). We see that at 
low temperature the particles completely fill all the states with lowest energy. 
Only those states at higher energy are partly occupied. At zero temperature, all 
states below a cutoff energy, £/ = £/ = o, is called the Fermi energy. The 
momentum, py = y/2m£f, is called the Fermi momentum. The distribution of 
particles in momentum space at low temperature is like a “sea” with all the 
lower states filled with particles. Only particles in states near the “top” of the 
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Fig. 7.16. Plots of the average occupation number, («i), as a function of energy, E\, at 
very low temperature. The solid line is a plot for T > 0 K, and the dashed line is a plot 
for T = 0 K. p is the chemical potential. 


“sea” can change their state. For this reason this distribution of particles is 
called the Fermi sea. 

Let us now compute the thermodynamic properties of the Fermi-Dirac gas. 
For large enough volume, V, we can again change the summation, E /, to an 
integration, 


E 


4ttV 
(27 rh) : 


P 2 dp 


(7.162) 


[cf. Eq. (7.130)]. For the Fermi-Dirac gas there are no singular terms to remove 
from the summation before we approximate the sum, E /, by an integral. 
Therefore, the magnitude of the momentum, p, in Eq. (7.162) ranges from 0 to 
co. The grand potential takes the form 

n FD (T, V,p) = -PV = r P 2 dp W + ze< 3 ^ 2m ]. (7.163) 

{2'Kh) Jo 


Similarly, the average particle number takes the form 


(N) = 



(7.164) 


Let us now make the change of variables, x 1 = (3p 2 /2m, in Eqs. (7.163) and 
(7.164). Then the pressure of the Fermi-Dirac gas can be written 


Q fd gk B T 

/■=- — = ^r-/V2(4 


(7.165) 


where At is the thermal wavelength [cf. Eq. (7.135)] and the function f $/2 (z) > s 
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defined as 

4 f 00 , 00 -a 

/5/2(z) = ^J o Afrln[l+z^] = ^(-l) Q+l -^. (7.166) 

The average particle density can be written 

<">=^T = t|/3/2(z), (7.167) 

where 


/>/»(*) 


x 2 dx 


eT +z 


=£(->) 


a+1 


a=l 


a 


3/2 ' 


(7.168) 


In Fig. 7.17 we plot / 5 / 2 (z) and/ 3/ / 2 (z) as a function of z. 

It is interesting to compare the pressure of an ideal Fermi gas with that of an 
ideal classical gas and an ideal Bose-Einstein gas. In Fig. 7.18 we plot P versus 
the average volume per particle, v = (n)~ l and show one isotherm of the same 
temperature for each gas. For the Fermi-Dirac gas we set g = 1 so we are 
comparing only the effect of statistics. The pressure of the Bose-Einstein gas is 
dramatically lower than the classical or Fermi-Dirac gas for small v. This 
happens because in a Bose-Einstein gas at low v (below the critical volume per 
particle) a macroscopic number of particles condense into the zero momentum 
state and can no longer contribute to the pressure. The pressure of the Fermi- 
Dirac gas, on the other hand, always lies a little above that of the classical gas. 
This happens because for a Fermi-Dirac gas only one or zero particles can 
occupy a given momentum state, whereas in a classical gas any number can 
occupy a given momentum state. As a result, when comparing a Fermi-Dirac 
ideal gas to a classical ideal gas with the same particle density, (n) = v _1 , the 



fig. 7.17. Plots of fc/iiz) and / 3 / 2 (z) versus z. 
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Fig. 7.18. Plots of the pressure of a Fermi-Dirac (FD), a Bose-Einstein (BE), and a 
classical (Cl) ideal gas as a function of volume per particle assuming the particles have 
the same mass and neglecting spin. One isotherm for each gas is shown. The 
temperatures of the three isotherms are the same. 


Fermi-Dirac gas will contain more particles at higher momentum. The Fermi- 
Dirac gas will therefore tend to have a higher pressure than the classical gas. 

It is interesting also to examine the behavior of the quantity, ( n)\j - , as a 
function of the chemical potential, p! . In Fig. 7.19 we plot (n)Xj versus the 
product, j3p ', for both a Fermi-Dirac and Bose-Einstein ideal gas. We see that 
the chemical potential of the Bose-Einstein gas remains negative and that the 
dominant growth in (n)\j occurs as p' — ► 0. For the Fermi-Dirac gas the 
product, /V, can be positive or negative and the dominant growth in («)A) 
occurs when (3p ' is positive and growing in value. As we shall see below when 
we focus on behavior of the Fermi-Dirac gas in the neighorhood of T = OK, the 
chemical potential for a Fermi-Dirac gas approaches a positive finite constant as 
T — > OK. This is not evident in Fig. 7.19. We can revert the series expansion of 
(n)\j/g [cf. Eqs. (7.167) and (7.168)] and obtain a series expansion for the 
fugacity, z = e (j( \ which is convergent for sufficiently low density and high 



Fig. 7.19. Plots of (n)Xj- versus (3p for a Bose-Einstein (BE) and Fermi-Dirac (FD) 
gas. 
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temperature. We obtain 


.3*1 ( n ) X T , 1 

“ g g ) + 





(7.169) 


The coefficients of various powers of the quantity, {n)\\/g, will always be 
positive. Thus, as T — > oo, z — > 0 and the product, /?//, must be large and 
negative. Since (3 — > 0 the chemical potential /z — > — oo at high temperature. 
For low temperatures, z — > oo and (3fi' —> oo. Since (3 — >■ oo, in the limit T —* 0 
the chemical potential can remain finite and indeed it does. 

Let us now compute the thermodynamic properties of the ideal Fermi-Dirac 
gas at low temperatures. Let us first examine the behavior of / 3 / 2 (z) at low 
temperatures. We may write it in the form 


4 r 

h/i{z) = 

V* JO 


[^ 2 -^+l] “ V^Jo J [^ + 1] 


= ^ <% 3/2 
3vttJ 0 


(7.170) 


[1 +«>“ 


where we have let y = x 2 in the second integral and v = fl/i'. To obtain the last 
integral we have integrated by parts. The function A(y, u) = e y ~ v /[\ + e y ~ u } 2 , 
which appears in the last integral, is essentially the derivative of the occupation 
number («/), and at low temperature is sharply peaked about y = v — (3 }jl' where 
the strongest variation in («/) occurs (cf. Fig. 7.20). Thus, to perform the 
integration in Eq. (7.170), we may expand y 3 / 2 in a Taylor series about y = u. If 
we then let t = (y — u), we can write / 3 / 2 (z) as 

Mz) = 3^r/'(T?W +r ,/2 '+r 1/2,2 + ••)• (7 - 171) 
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The contribution from the lower limit in the integral will be of order e At 
low temperatures we can neglect it and extend the lower limit to — oo so that 

h ' l(z) = £ < *(T77j 5 + r' P,+ l v ~' lhl + ' ' ' (7 ' 172) 


To evaluate Eq. (7.172), we must evaluate integrals of the form 


In 


*00 

-00 


dt 


_jV_ 

(l+O 2 ’ 


(7.173) 


The result is /„ = 0 for n odd, 7 0 = 1, and /„ = («- l)!(2n)(l - 2 1- ' 1 )£(w) for 
n even, where £(«) is a Riemann zeta function [£(2) = £(4) = 

C(6) = ^], etc. 

We can use the above results to obtain an expansion for the quantity {n)\\/g 
which is valid at low temperature. From Eqs. (7.167), (7.172) and (7.173), we 
find 


("> y = 3^ [(/ V ) 372 + j (/ V )“’ /2 + •••]. (7.174) 

If we take the limit T —* 0 K in Eq. (7.174), we find the following density- 
dependent expression for the chemical potential 


fi'{T = 0 ) = = e F = 


2m 


6-7T 2 (n) 


2/3 


(7.175) 


The chemical potential, /io = Sf, at T = 0 K is also called the Fermi energy, 
because at T = 0 K it is the maximum energy that a particle in the gas can have 
(cf. Fig. 7.16). At very low temperatures, only particles within a distance, k B T, 
of the Fermi surface can participate in physical processes in the gas, because 
they can change their momentum state. Particles lower down in the Fermi sea 
have no empty momentum states available for them to jump to and do not 
contribute to changes in the thermodynamic properties. Equation (7.174) may 
be reverted to find the chemical potential a a function of temperature and 
density. The result is 


H' = £ F 


_ ^ (ksT 

12 \e F 



(7.176) 


Thus, the chemical potential approaches a finite constant as T — ► 0 K. In 
Fig. 7.21 we plot the chemical potential of an ideal Fermi-Dirac gas a function 
of temperature for fixed particle density. 
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Fig. 7.21. A plot of the chemical potential of a 
Fermi-Dirac gas as a function of temperature for 
fixed particle density. 


The internal energy, U = (H) = J2i £ i n h can be computed in a similar 
manner. At low temperature, it is given by 

H<^ i+ n(¥) 2+ -"]- (7 - 177) 


From Eq. (7.177) we obtain the heat capacity of the Fermi-Dirac gas in the 
limit T — ► OK. We find 


/df/\ _ (N)tt 1 k\T 

\dTj v ^ 2 £p 


(7.178) 


Thus, the heat capacity of an ideal Fermi-Dirac gas depends linearly on the 
temperature at very low temperatures and goes to zero at T = 0 K in accordance 
with the third law. It is important to note, however, that particles in an ideal 
Fermi-Dirac gas can have a large zero-point momentum and, therefore, a large 
pressure and energy even at T = 0 K. This is a result of the Pauli exchange 
principle. 

It is a simple matter to show that at high temperatures, all quantities 
approach values expected for an ideal classical gas. 


■ EXERCISE 7.8. Compute the variance in particle number, (( N — ( N )) 2 ), 
for a Fermi-Dirac gas for temperatures near 7 = 0 K. 


Answer: First note the thermodynamic identity, 

Near T = 0 K, we can write 


V g 



( 1 ) 

( 2 ) 
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where g is a multiplicity factor due to the degeneracy of spin states, and 
A j = yj 'hrh 2 / mksT is the thermal wavelength. From Eq. (2), we find 

w= M(vy /2 

' ' 3v^ \2irh 2 ) ' 

If we now take the derivative of Eq. (3), we obtain 


( 3 ) 


f d(N) \ _ 2gV/jL ,l / 2 m 3 / 2 
V dp' ) t,v \Zir(2irh 2 ) 3 / 2 


( 4 ) 


Let us now solve Eq. (3) for p' as a function of ( N ) and substitute into Eq. 
(4). We find 


1/3 


( d{N) \ _ y ™ ( 3g 2 ( N) \ 

\ dp' ) T y h 2 V 47r 4 V’ ) 


( 5 ) 


The variance then becomes 


<<»-TO•>-vvf(as) , '’ (6| 

The variance in particle number for a Fermi-Dirac gas at low temperature is 
much smaller than that of a Bose-Einstein gas. 


■ EXERCISE 7.9. Compute the density of states at the Fermi surface for 
an ideal Fermi-Dirac gas confined to a box of volume V. 

Answer: For simplicity we assume the box is cubic with volume V = L 3 . 
The momentum is 


Pi 



(i) 


where l x , l y , and l z are integers each ranging from — oo to oo. Let us denote 
the set of integers / = l x , l y l z . Each different choice of integers l x , l y , and l z 
corresponds to a possible state of the system. The allowed values of the 
momentum, p, form a grid of points located at p x = 2ixhl x /L, p y — 2nhl y /L, 
and p z — 2nhl z /L in p space. The volume per point in p space is (27 rh/L) 3 . 
The number of points per unit volume is (L/2irh) 3 . The number of points 
(states), n, inside a spherical volume of p space with radius less than p is 
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The energy of a particle with momentum p = hk is e k = h k 2 /2m. The 
number of states in the interval v — »• v + dv is 


dv 

dv = — — de k 
ds k 


m 3/2 Vy/£^ 

V2ir 2 h 3 


de k- 


The density of states at the Fermi surface is 


N(0) = 


( du\ n?l 1 V s /e ! 

V de k/t.tj V2n 2 h 3 


( 3 ) 

( 4 ) 


where 6/ is the Fermi energy. It should be noted that Eq. (4) is the density of 
state for a single spin component. 


^ SPECIAL TOPICS 

^ S7.A. Heat Capacity of Lattice Vibrations on a One-Dimensional 
Lattice — Exact Solution 

We can find an exact expression for the heat capacity of a one-dimensional 
lattice of harmonically coupled atoms. Let us consider a lattice of N atoms of 
mass m coupled to one another with harmonic springs, all of which have force 
constant, k. For simplicity, we will assume that the end masses are coupled to 
rigid infinitely massive walls by the same harmonic springs (cf. Fig. 7.22). The 
momentum and displacement operators for the 7 th atom are pj and qj, 
respectively (the displacement, qj, is measured relative to the left wall). When 
tlie lattice is completely at rest, the springs have length a. The distance between 
the walls is L = (N + 1 )a. The Hamiltonian operator is 

Nal) 2 , 

(7.179) 


- % - ° l i 1 ~ al ) 2 +f 


N - 1 


7=1 


7=1 


L = (N + l)a 

1 ; 

m » m — ''V ■ « ■ 

> 

97- 

Fig. 7.22. A one-dimensional lattice of N masses, m, coupled by harmonic forces with 
force constant k. The lattice is attached to rigid, infinitely massive walls at each end. qj 
measures the displacement of the yth mass from the left wall. The distance between 
walls is L = (N -1- 1 )a, where a is the spacing between masses when the lattice is at 
“rest” (for a quantum lattice, there will always be zero point motion). 
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where 1 is the unit operator. The position and momentum operators obey the 
commutation relations [qj,pf] = ihSjj , \pj,Pj>] = 0, and [qj,qj>] = 0. 

We can also measure the displacement, uj, of the jth atom relative to its 
equilibrium position. If we let qj =ja\+ Uj, the Hamiltonian operator takes the 
simpler form 


N p2 


N - 1 




2m 2 — 

j= 

Let us now introduce the interaction matrix, 



( 2 

-1 

0 •• 

• 0 

° ^ 


- 1 

2 

-1 •• 

• 0 

0 

V = - 

1 





m 

1 ° 

0 

0 •• 

• 2 

-1 


\ 0 

0 

0 •• 

■ -1 

2 / 


Then the Hamiltonian operator takes the form 

1 


H== 2 ~P T ip + ™u T Vu, 


(7.180) 


(7.181) 


(7.182) 


where p — {pi ,p 2 , • • • ,Pn ) and u = (u\ , m 2 , • • • , «w) are row matrices contain- 
ing the momentum and position operators, respectively, of the N atoms. The 
quantities p T and u T denote the transpose of column matrices and I is the unit 
matrix. 

The interaction matrix, V , is symmetric and therefore can be diagonalized by 
an orthogonal matrix, X. Since X is orthogonal it satisfies the conditions, 
X T ■ X = X ■ X T = 1. Because V is an N x N matrix it has N eigenvalues which 
we denote by a where a = 1, 2, ... N. Then 


X T ■ V • X = A, 

where A is the diagonal matrix 


A = 

(A 

0 

0 • 

^2 • 

•• 0\ 

•• 0 


\0 

0 • 



(7.183) 


(7.184) 


One can easily check that the eigenvalues are 

7ra 


u>l = 4u^sin 2 


2(N + 1)/ ’ 


(7.185) 
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where uo = \Jn/m is the natural frequency of a harmonic oscillator with force 
constant k and mass m. Matrix elements of the orthogonal matrix are 


*/,a = 


^ Tja 


N+ 1 \N + 1 


-sin 


(7.186) 


We can now use the orthogonal matrix to transform to normal mode 
coordinates, {Pa, Qa )• Let uj — i Xj a Q a and pj — X3a=i ^j>aP a* Then the 
Hamiltonian operator becomes 

A, = iE^ + lE“« ! & (7.187) 

Q = 1 Q=1 


Thus, the normal modes consist of a collection of uncoupled harmonic 
oscillators, each with different frequency, ui a . They correspond to standing 
sound waves on the lattice. Since the orthogonal transformation is also 
canonical, the commutation relations are preserved so that [Q a , P a >] = ih6 a)0t >, 
[Pa, Pa/ } = 0 and [0 Q , Q a ] = 0. 

We now have one last step in order to put the Hamiltonian operator in 
particularly simple form. Let us introduce the creation operator fit, and 
annihilation operator, a a , for quanta in the ath normal mode (sound mode), 


a 


a 



and 



(7.188) 


Note that Q a and P a are Hermition operators. 

Using the commutation relations for the normal mode coordinates, we can 
show that the creation and annihilation operators satisfy the commutation 
relations, [a a ,a\] = 1<5 QiQ '. The Hamiltonian operator then takes the form 

N N 

H = ^ huj a (a ] a a a +±i) = ^ hu a (h a + \ 1), (7.189) 

Q=1 a=l 

where h a = a\^a a is the number operator for energy quanta (phonons) in the ath 
normal mode, \n a ) is its eigenstate and h a \n a ) = n a \n a ). 

The partition function can be written in terms of the Hamiltonian, Eq. 
(7.180) or (7.189), since inside the trace we can use the orthogonal matrix to 
transform between them. We simply insert the identity, X • X T = 1, into the 
trace and obtain 


Tr;v {e-P*) = Tt n (X • X T • e' pk ) = Tr^ {X T ■ e~ pk ■ X) = Tr^ (e - ^ " *). 

(7.190) 
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Therefore, the partition function can be written 


Zn{T) = Tr^v 


ex P ( —fl ^ (h a + 


a— l 


JL e -m^ a /2 


= Utz 

Q=1 


-3hui a ' 


The average energy is 


_ _ dln(Z^) _ ^ fiujg A huj a 
\ d p ~ 2—1 2 r 

“ a=l a=l 

The average number of quanta in the ath phonon mode is 

= _ 1 ’ 

which is Planck’s formula. The heat capacity is 

" dT ' 


(7.191) 


(7.192) 


(7.193) 


(7.194) 


The summation cannot be done explicitly, but for large N we can change the 
summation to an integration to obtain a manageable form for the heat capacity 
as we shall show below. 

It is easy to see that at high temperature, Eq. (7.194) gives the correct 
classical expression for the heat capacity. First note that as T — > oo, (3hu) a <C 1 
and e^ a « 1 + f3huj a + . Then the lowest order term in an expansion in powers 
of / 3hu a is C/v = Nk B , which is the correct expression for the heat capacity of N 
classical harmonic oscillators on a one-dimensional lattice. 


^ S7.A.1 . Exact Expression — Large N 

If the number of atoms in the chain is very large, then in Eqs. (7.192) and 
(7.194) we can change the summation into an integration. The normal mode 
frequency, u, as a function of mode number, a, is 


u) = 2a>osin 


na 


2(N +1)/ 


(7.195). 


where 2o>o is the maximum allowed phonon (sound) frequency. Let us denote 
this “cutoff” frequency by u>l = 2uq. For N » 1 we can write 


N ,N ru L 2 N r“ L duJ 


= N, 


(7.196) 
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where 


, x da 2 N 

SM = ~ 7 ~ = 7 2 == =3 

1 T\Ju L l —U 2 


(7.197) 


is the density of states (obtained by differentiating Eq. (7.195) with respect to 
u)) and g(oj)dto is the number of phonon modes (not quanta) in the frequency 
interval u> — ► uj + dco. The average energy can be written 


j fWi ru>L 

{E) hcjg(co)du + Hun(u)g(u)du> 

2 Jo Jo 


Nfiu L 

7 r 


'U>l 

hun(u)g(u)du , 
o 


(7.198) 


where n(ui) = (e^ — 1) 1 is the average number of quanta in the phonon 
mode with frequency u. The quantity NHul/tt is the zero point energy of the 
lattice. In the limit T — ► 0, n(u) — > 0 and (E) — > Nhcj L /n. Thus, the quantum 
lattice always has some motion even at absolute zero kelvin. This is a 
consequence of the Heisenberg uncertainty relation, which does not allow the 
momentum of the masses to be zero if the masses are constrained to remain in 
the neighborhood of their lattice sites. 

^From Eqs. (7.194) and (7.196), the heat capacity can be written 


Cn = k B 



{Phujfe^ 
(eft” - l) 2 


2 Nk B r*- J (/ 3hu) 2 e ^ 

7t J 0 [ePnu - l) 2 y/u) 2 L -u 2 


(7.199) 


It is useful, however, to make one last change of variables. Let x = (3hu and 
define a lattice temperature, 77 = hui/k B . This is the temperature the lattice 
must have so the thermal energy, k B T, is equal to the largest allowed quantum of 
energy, hu L . The heat capacity then takes the form 


2 Nks r n/T 

C N = 

7T Jo 


dxx 2 ^ 


,0 (e*- 1 ) 2 sJ(T L /T) 2 -x 2 

In the limit T — * 0, the heat capacity becomes approximately 


where we have used the fact that 

’°° dx^e* 


*CX 

h = 

JO 


E 


n\ 


(^-l) 2 («+!)" 


(7.200) 


(7.201) 


(7.202) 
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(expand the integrand in powers of e~ x and integrate). For special values of n 
the sum can be performed. For example, I 2 = n/ 3 and U = 47 t 4 / 15 . Thus, at 
very low temperatures the heat capacity of this one-dimensional lattice goes to 
zero linearly with the temperature. Notice also that at very low temperatures the 
high-frequency (short-wavelength) phonon modes give almost no contribution 
to the heat capacity of the lattice because there is not enough thermal energy to 
excite them. 


^ S7.A.2. Continuum Approximation — Large N 

We will assume that we do not know the exact dispersion relation [Eq. (7.185)] 
for the phonon modes, and we will use the information we do have. We know 
that there are N atoms on the lattice, that it has length L, and that it is pinned on 
the ends. The allowed wavelengths for phonons on an elastic lattice of length L 
which is pinned at its ends are given by A Q = 2 L/ot, where a = 1,2,..., TV. The 
allowed phonon wavevectors are k a = 2n/\ a = na/L. The minimum allowed 
wavelengths are of the order of the lattice spacing, and therefore there is a 
cutoff frequency which we shall call the Debye frequency, u D . The dispersion 
relation for the phonons is u> a = ck a = c(na/L), where c is the speed of sound. 
The Debye frequency can be found as follows. The number of phonon modes is 
equal to the number of atoms, so 

N j*N L, C w ° Jin 

N = Y' 1 « da^\ g D (u)du = — du = —. (7.203) 

Q _i Ja=l JO CTTJo C7T 


Therefore, the Debye frequency is ujq = Ncn/L and the density of states is 




da L N 

du C7T Up 


(7.204) 


The density of states in the continuum approximation is a constant, whereas the 
exact solution gives a frequency-dependent density of states. In Fig. 7.23 we 
plot the exact density of states and the density of states obtained using the 
continuum approximation. The continuum approximation dramatically under- 
estimates the density of states near the cutoff frequency. 

The average energy is given by Eq. (7.198) if we simply replace the lattice 
density of states, g(u), by the continuum density of states, go(v). We then find 


(E) = + P hun{u)dcj. (7.205) 

4 u D ) 0 

The heat capacity is 

Nk B p {phufe^du _ Nk B T { Td/t dxx 1 ^ 

N u D Jo (e^ - l ) 2 t d Jo (e* - l ) 2 ’ 


(7.206) 
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Fig. 7.23. Plots of the exact density of states and the density of states obtained using the 
continuum approximation, both for the one-dimensional lattice. Here we take 

uj c = ujl — No- 


where in the second term we have again made the change of variables, x = (3huj, 
and have defined the Debye temperature, T D = hu D /k B . 

In the limit T -* 0, the heat capacity in the continuum approximation 
becomes 


C N 


Nk B T f°° dx jcV 

Td Jo (e* — l ) 2 


+ 0{T 3 ) 


NksT'K 1 

3T d 


+ o(t 3 ). 


(7.207) 


At very low temperatures the continuum approximation also gives a heat 
capacity for the one-dimensional lattice which goes to zero linearly with the 
temperature. The coefficient differs slightly from the exact result in Eq. (7.201). 


► S7.B. Momentum Condensation in an Interacting Fermi Fluid 
[15-1 7] 

An ideal Bose-Einstein gas can condense in momentum space and thereby 
undergo a phase transition, but an ideal Fermi-Dirac gas is prevented from 
doing so because the Pauli exclusion principle does not allow more than one 
fermion to occupy a given quantum state. Electrons in a conducting solid are 
free to wander through the lattice and form a Fermi fluid. At low temperatures 
the electrons form a Fermi sea and only those near the Fermi surface affect the 
thermodynamic properties of the electron fluid (cf. Section 7.H). The electrons 
experience a mutual Coulomb repulsion which is screened by lattice ions. 
However, as first noted by Frohlich [18], those electrons in the neighborhood of 
the Fermi surface also experience a lattice-phonon-mediated effective attraction 
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(two electrons may in effect be attracted to one another because they are both 
attracted to the same lattice ion). Cooper [15] showed that this effective 
attraction at the Fermi surface could cause bound pairs of electrons to form, and 
these pairs could then condense in momentum space, giving rise to a phase 
transition in the interacting Fermi fluid. Bardeen, Schrieffer and Cooper, [16] 
showed that this momentum space condensation of Cooper pairs is the origin of 
superconductivity in materials. In 1972, they received the Nobel Prize for this 
work. 

We shall now derive the thermodynamic properties of a Fermi fluid which 
can form Cooper pairs. It is found experimentally that Cooper pairs have zero 
total angular momentum and zero total spin. If the pairs are not undergoing a 
net translation through the fluid (no supercurrent), then we can assume that only 
those electrons at the Fermi surface with equal and opposite momentum and 
opposite spin components are attracted to one another. We shall assume that all 
other electrons behave like an ideal gas. With these assumptions, we can write 
the Hamiltonian of the electron fluid in the form 


H — ^2 + ^22 (7.208) 

k,A k 1 

where e k = h 2 k 2 /2m, and A denotes the z component of spin of a given electron 
and takes values A =| or A =[ (spin component or —\h, respectively). 
The operators, A and a kjA , respectively create and annihilate an electron with 
momentum hk and spin component A (cf. Appendix B). They satisfy fermion 
anticommutation relations. The interaction term in Eq. (7.208) destroys a pair 
of electrons with momenta h\ and —h\ and opposite spin components, and it 
creates a pair of electrons with momenta hk and —hk and opposite spin 
components. Since the electrons only experience an attraction at the Fermi 
surface, the interaction energy, V k) i, can be written 


j -Vo if \p! - e k |<Ae and \p! - ej| 
0 otherwise, 


(7.209) 


where Vo is a positive constant, p! is the chemical potential of the fermi fluid, 
and Ae is a small energy of order k B T. 

In order to simplify our calculations, we shall compute the thermodynamic 
properties of this interacting Fermi fluid in the mean field approximation. We 
write the Hamiltonian in the form 


Hmf = 22 £k ^M^k,A + 22 + X^ A k a k, r «-k,j’ (7.210) 

k,A k k 


where 


A k = 



if \p - £ k |^ Ae, 
otherwise 


(7.211) 
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and 


/ / 

A = —Vo ^(a_ kji a k)T ) and A* = -Vo ^(4,t a - M ). (7.212) 

k k 

The prime on the summation, J^' k , means that the summation is restricted to a 
distance, Ae, on either side of the Fermi surface. The average, (a_ k) ja k) f), is 
defined as 


(fi_ k!i a k)T ) = Tr[pfi_ k)i a k>T ], 


(7.213) 


where the density operator, p, is defined as 


e -0{H mf -vN) 

Tr[e~ P&mf-iiN)]' 


(7.214) 


The average, (fi kT ^ k j), is similarly defined. The number operator, N, is 
defined as 


* = (7-215) 

k,A 

The quantity A is called the gap function and may be real or complex. It is a 
thermodynamic quantity and is a measure of the average binding energy of all 
the Cooper pairs. If a macroscopic number of Cooper pairs form, then 
(^k,T^-k,i) ~ yfic and (4,t4 k ^) ~ y/nf where n c is the average number of 
Cooper pairs in the fluid. A is the order parameter for this transition. 

It is important to notice that the Hamiltonian, H m f, does not commute with 
the number operator, N, if A ^ 0. This means that if a macroscopic number of 
Cooper pairs form, the system does not conserve the particle (electron) number 
and the gauge symmetry is broken. The formation of a macroscopic number of 
Cooper pairs is a phase transition somewhat analogous to Bose-Einstein 
condensation (cf. Section 7.H). In both cases, gauge symmetry is broken. Since 
we are working in the grand canonical ensemble and only specify the average 
particle number, the fact that gauge symmetry is broken is not a problem. If a 
macroscopic number of Cooper pairs form, the total energy of the system is 
lowered. The transition to the condensed phase occurs when the thermal energy, 
kfiT, which tends to break Cooper pairs apart, becomes less important than the 
phonon-mediated attraction between electrons. 

It is useful now to introduce an effective Hamiltonian 

k = X^k(4,At - «-k,id-k,|) + J2 A k4,T«-ka’ 

k k k 


(7.216) 
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where 


£k — M — 


h 2 k 2 

2m 




(7.217) 


and we have made use of the fermion anticommutation relations. The effective 
Hamiltonian, K, differs from H m f — fiN only by a constant term. Therefore the 
density operator can also be written 


e -f3(H mf -liNj e (3K 

Tr ~Tr ‘ 


(7.218) 


The effective Hamiltonian, K, can be written in matrix form: 


where 


K = (7.219) 

k 


K k = 


( (k Ak \ 

Uk-^J’ 



4 = (4, a_ ka ). (7.220) 


As was first shown by Bogoliubov [19], the effective Hamiltonian, K, can be 
diagonalized by means of a unitary transformation which preserves the fermion 
anticommutation relations. In so doing, we obtain the Hamiltonian for effective 
excitations (called bogolons) of the system. To diagonalize the effective 
Hamiltonian, we introduce a 2 x 2 unitary matrix, 


U k = 




(7.221) 


Since 1/^4 =UilJl = 1 (unitarity), we must have |«k| 2 + |vk| 2 = 1. We also 
introduce the vectors 


f k = 



^k — (7k, o 7k,i)> 


(7.222) 


which are related to the vectors, a^, via the unitary transformation 


d k =U k f k . (7.223) 

The physical significance of the vectors, fk, will become clear below. It is easy 
to show that since A and a k ,A obey fermion anticommutation relations, the 
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operators, 7^ and 71^ (i = 0,1), must also obey fermion anticommutation 
relations 


[7k, i, 7^]+ = [ / 7k,i)7k',i']+ = [7k4> 7^]+ = (7.224) 


If we revert Eq. (7.223), we see that 7^0 decreases the momentum of the system 
by hk and lowers the spin by h (it destroys a particle with quantum numbers, 
(k, T), and creates one with quantum numbers, (— k, [), whereas 7^1 increases 
the momentum of the system by hk and raises the spin by h. 

We now require that the unitary matrix, 0 k, diagonalize the effective 
Hamiltonian, AT*. That is, 

UlK k U k =E k with£ k =( £ ^° (7.225) 

We find that £k,o = Ev. and £k,i = -£* with 

E k = \[ii + |A k | 2 . (7.226) 

With this transformation, we have succeeded in reducing the interacting Fermi 
gas of electrons to an ideal Fermi gas of bogolons. In terms of bogolon 
operators, the effective Hamiltonian becomes 

K = Y^4fiiUlK k U k ui<. S k = 7k A 7k 

k k 

= ^ y (^k,0 7k, 0 7k, 0 — ^k,l 7k, 1 7k, 1 + ^k,l)- (7.227) 

k 


The effective Hamiltonian, when written in terms of bogolon operators, looks 
like that of an ideal Fermi gas. The bogolons are collective modes of the system 
and play a role analogous to that of phonons in a Debye solid, although their 
dispersion relation is quite different. 

We can now obtain a self-consistent equation for the gap function, A. First 
note that 


(7k,o7k,o) — 


(1 -F e P Ek '°) 2 


1 — tanh 



and 


1 

(1 +e~P E ^) 


1 

2 


|l -t- tanh 



(7.228) 


(7k,i7k,i) = 


(7.229) 
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Then 


( a k«k) = f t . . 


1 - (7 k)0 7k,o) 0 


= U k 


0 


(7k,i7k,i) 


Ul 


_ 1 ? , 1 7 / tanh (/2£k/2) 0 \ f 

2 + 2 ^ V 0 -tanh ((3E k / 2) ) ° k 


1. 1 _ (E k 0 \_+ 

= 2 1 + 2^^( 0 -Ej k ^ hW ) 

= ^i+ 2 ^^k tanh (/9£ k /2). 


(7.230) 


Let us now equate off-diagonal matrix elements. We write 


-(a- k>i a k , T ) = ^r tanh (/^k/2) 


(7.231) 


If we multiply Eq. (7.231) by V k i , integrate over k, and use Eqs. (7.211) and 
(7.212), we obtain 


1 = V 0 tanh ((3E k / 2). 


(7.232) 


It is useful to n ote that un der the primed summation the bogolon energy can be 

written E k = yjti k + |A| 2 . Equation (7.232) is the equation for the gap function. 
It has been obtained from the grand canonical ensemble. Therefore, the 
solutions of the gap equation correspond to extrema of the free energy. The 
solution at a given temperature which corresponds to the stable thermodynamic 
state is the one which minimizes the free energy. Since the energy, E k , depends 
on the gap, Eq. (7.232) is rather complicated. 

Let us now determine some properties of the gap function from Eq. (7.232). 
If we assume that the system is contained in a large volume, V, we can change 
the summation to an integration [cf. Eq. (7.162)]. Note that 


£ 


V 

2n 2 


dkk 2 


m > l 2 V 
\/2 n 2 h 3 



(7.233) 


where we have Eg. (7.217). The summation, ]T} k , which is restricted to the 
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neighborhood of the Fermi surface, can be written 

w 3/2y fAe rAe 

p - 234) 

where we have set /i « Sf (gy is the Fermi energy) and N(0) = mVkf /In 2 h 2 is 
the density of states at the Fermi surface for a single spin state (cf. Exercise 
7.9). We can now write Eq. (7.232) as 


1 = V 0 N( 0) [ <*& L 

Jo V^ + |A(r)| 


tanh 




(7.235) 


Equation (7.235) determines the temperature dependence of the gap, A (T), and 
can be used to find the transition temperature. 

The energy of bogolons (measured from the Fermi surface) with momentum 

hk is E k = yj(£ + |A(T)| 2 . It takes a finite energy to excite them, regardless of 
their momentum, because there is a gap in the energy spectrum. At the critical 
temperature, T c , the gap goes to zero and the excitation spectrum reduces to that 
of an ideal Fermi gas. The critical temperature can be obtained from Eq. 
(7.235). It is the temperature at which the gap becomes zero. Thus, at the 
critical temperature we have 


1 - VqN (0) 


f^ tanh [f ] 

£k 


A(0)V 0 f 

Jo 


0 c Ae/2 tan h(jc) 

ax — 


= W(0)V 0 ln^/? c Ae|, 

where (3 C = ( k B T c )~ l , a = 2.26773, and we have used the fact that 


(7.236) 


tan ^fo) dx = \n(ab), (7.237) 

for b > 100. Thus, Eq. (7.236) holds when /3 c Ae/2 > 100. This means that 
A(0)Vo <0.184 and therefore use of Eq. (7.236) restricts us to fairly weakly 
coupled systems. From Eqs. (7.236) and (7.237) we obtain 

k B T c = | A ee~ l l N ^ v \ (7.238) 


for (3 C As /2> 100. Thus, the critical temperature, T c , varies exponentially with 
the strength of the attractive interaction. 
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We can also use Eq. (7.235) to find the gap, A(0) = Ao, at T = 0 K. Since 
tanh(oo) = 1, we can write 

f Ae 1 /As\ 

1 = VoMO) j o ^ k -=== i =vyv(0)siiih- 1 (— j, (7.239) 


or 


Ao 


As 

sinh (l/Vo^V(0)) 


lAee-'WW 0 . 


(7.240) 


The rightmost expression for Ao applies for weakly coupled systems when 
AT(0)Vo < 0.184. Comparing Eqs. (7.238) and (7.240), we obtain the following 
relation between the critical temperature and the zero temperature gap for 
weakly coupled systems : 


- = -= 1.764. (7.241) 

k B T c a 

Equation (7.241) is in good agreement with experimental values of this ratio for 
superconductors. Equation (7.235) may be solved numerically to obtain a plot 
of the gap as a function of temperature. The gap function is a real function for 
the case (such as we are considering here) when no supercurrent is present. We 
show the behavior of A (T) in Fig. 7.24 for weakly coupled systems. 

Since bogolons form an ideal gas, the entropy can be written in the form 

S = -2 k B ^[/ikln(n k ) + (1 - w k )ln(l - n k )], (7.242) 

k 

where « k = (1 + ^ £k ) -1 (cf. Problem 7.23). The heat capacity, C V)N , is easy to 



Fig. 7.24. A plot of the ratio A(7)/Ao 
versus the reduced temperature, T /T c , for 
a weakly coupled system. 
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find from Eq. (7.242). let us first note that for a Fermi gas at very low tempera- 
ture we have /i « e/, where e/ is the Fermi energy, and (dfj,/dT) v ^ ~ 0. Thus, 


Cy,N 





■1 + 2 ? 


1 ^|A k | : 


90 


(7.243) 


We can now examine the heat capacity, both at the critical temperature and in 
the limit T — > 0 K. 

Let us first look at the neighborhood of the critical point. The first term in Eq. 

(7.243) is continuous at T = T c , but the second term is not since d\A^\ 2 /d/3 has 
a finite value for T <T C but is zero for T > T c . Near T = T c , we may let 
£ k — ► |£k|. Then the heat capacity just below the critical temperature is 


C< N * 2 0 c k B £ U k + 2 A ( 


(7.244) 


T=T 


and just above the critical temperature it is 


dm 




(7.245) 


The discontinuity in the heat capacity at the critical temperature is 

AC -C< r> Q 2 b ^ nk d^ k D 

AC V ,„ - C V>{N) - C VfW - -P c k B 2_^ I ■ -Qp- j 3|£j 

^|A| 2 \ 


= -k B (3 z c N{ 0) 


d(3 ) 

/ T=1 


=m 


m 

dT 


(7.246) 


T=T C 


Thus, the heat capacity has a finite discontinuity at the critical temperature, as 
we would expect for a mean field theory. 

Let us now compute the heat capacity in the limit T — ► 0. As we can see 
from Fig. 7.24, the gap function, A, approaches a finite value, Ao, as T — ► 0 and 
dA/ dT — ► 0 as T — ► 0. As a result the heat capacity takes a fairly simple form 
in the limit T — ► 0. If we assume that \j! « Ef and A « A 0 in Eq. (7.243), then 
the heat capacity takes the form 


C v ^-20k B J2El^ = 20^ B j:E\ 


,0E k 


* (l+^ £k ) 


2 ’ 


(7.247) 
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where E k = y/^ + Aq. In order to change Eq. (7.247) it is useful to introduce 
the bogolon density of states. We can write 


E 


m 3//2 V 

y/2ir 2 h 3 


r °° 17? I 1 V 

o ^ k_ 7!^ 3 J 


’ LXJ r 

V' 

A 0 L V 




1/2 

v^Al' 


(7.248) 


For momenta, k & kf, the density of states is singular. Therefore, the dominant 
contribution to the integral comes from the neighborhood of the Fermi surface 
and we can write 


£«*((>) 


1,00 E k dE k 

A 0 V^k ~ A 0 


(7.249) 


Let us next note that in the limit T — » 0 we can write e^ Ev /(\ 4- e^ Ek ) 2 « e /3Ek . 
Thus, the heat capacity takes the form 


C V>N « 2/3 2 k B N{0) 


‘°° 

A 0 x/^k — ^0 


,-/3£k 


(7.250) 


The integral in Eq. (7.250) is easy to do if we use a trick. Note that 

EkdEk 


I = 


e ~ 0Ek = Aotfi^Ao), 


Iao \/E 2 - A 2 
where K\ is a modified Bessel function [19]. But 


E 3 dE k 


-0E k _ 

A 0 y/El-Af d(P 


d 2 I 1 


^ = = -A^[3^(/?Ao) + K 3 (PA 0 )]. 


Thus, the heat capacity takes the form 


1 


C v ,n = -P 2 k B N{0)A 3 0 [3ft (/?A 0 ) + K 3 {(3A 0 )}. 


(7.251) 


(7.252) 


(7.253) 


If we now ma ke use of the asymptotic form of the modified Bessel functions, 
K n (P Ao) ~ a/7t/2/?Ao e -/?A °, the heat capacity takes the form 


C VjN « v / 27r/? 3/2 ^flA^(0)Ao /2 e -/?A ° (7.254) 


in the limit T — > 0. Thus, the heat capacity of the condensed Fermi fluid goes to 
zero exponentially with temperature rather than linearly as in the case for an 
ideal Fermi gas. In Fig. 7.25 we show a sketch of the heat capacity of the 
interacting Fermi fluid (superconductor). The solid line is the Fermi fluid, and 
the dashed line is an ideal Fermi gas. 
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Cv,N 



Fig. 7.25. A sketch of the heat capacity for a superconductor. The straight dashed line 
gives the heat capacity in the absence of interaction (ideal Fermi gas). The solid line 
shows the jump in the heat capacity at the critical point and the exponential decay for 
temperatures below the critical point. 



Fig. 7.26. Variation of A /k B T c with reduced temperature, T/T c , for tin. The data points 
are obtained from ultrasonic acoustic attenuation measurements [20] for two different 
f requencies. The solid line is BCS theory. Reprinted, by permission, from R. W. Morse 
and H. V. Bohm, Phys. Rev. 108, 1094 (1954). 

The mean field theory gives a surprisingly good description of the behavior 
°f real superconductors. In Fig. 7.26 we show experimental measurements of 
fhe gap function, A, as a function of temperature for tin. The solid line is the 
tfiean field theory of Bardeen, Cooper, and Schrieffer. The experimental points, 
which are obtained from ultrasonic accoustic attenuation measurements [21], 
ht it very well. 
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► S7.C. The Yang-Lee Theory of Phase Transitions [22, 23] 


Yang and Lee have used very simple arguments involving the grand canonical 
ensemble to arrive at a mechanism by which a phase transition can take place in 
a classical fluid. We present it here because it gives valuable insight into the 
structure of the grand partition function. 

Let us consider a classical system of particles interacting via a potential, 

( oo if |ty| < a 

V(|qy|) = { -eg, (7.255) 

{ 0 if * < |%|, 

where q (y — q, — q y Because the particles have an infinite hard core, there is a 
maximum number of particles, M, which can be fitted into a box of volume V. 
Therefore, the grand partition function must have the form 


MT, V ) 



-0 



N(N-\)/2 

£ v(|q#) 

i <j=\ 


(7.256) 


where \ T is the thermal wavelength and Qn{T, V ) is called the configuration 
integral and is defined as 


e w (r,V) = jrfq w exp|-/3 ^ V(| % |)j (7.257) 

In the last term in Eq. (7.256), we have performed the momentum integrations. 
Since we are now dealing with classical particles, the phase space coordinates 
commute and the momentum integrations are trivial. Information about 
deviations from ideal gas behavior is contained in the configuration integral. 
For N > M,Q n (T,V) =0 because the hard core in the potential prevents more 
than M particles from occupying a box of volume V. 

Let us now introduce a new variable, y = e^fXj. Then, Eq. (7.256) can be 
written in the form 


M N 

Z,(T,V) =y2 y —Q N (T,V). (7.258) 

N = 0 N- 

We see that for finite volume, V, the grand partition function, ZJT, V), is a 
polynomial of order M in the parameter y. The coefficients of yr are positive 
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and real for all N. Since Z /i (r, V ) is a polynomial of order M, we can rewrite it 
in the form 


wvi=nf>-4 ( ? - 259 ) 

i= i v 

where the quantities y,-, are the M roots of the equation Z^T, V) = 0. Because 
the coefficients Qn/N\ are all real and positive, none of the roots y,- can be real 
and positive for finite M if we wish to satisfy the equation Z M (r, V) — 0. 
Therefore, for finite M all roots of y must be either real and negative or 
complex. If they are complex, then they must occur in complex conjugate pairs 
since Z /i (T, V) is real. As the volume is increased, the number of roots, M, will 
increase and move around in the complex plane. In the limit V — *• oo, it can 
happen that some of the roots will touch the positive real axis (cf. Fig. 7.27). 
When this happens, a phase transition occurs because the system can have 
different behavior for y < yo and y > yo, where yo is the value of the root on the 
real axis. In general, the pressure, P, will be continuous across the point yo, but 
the density and/or higher derivatives of P will be discontinuous (we give an 
example later). 

The pressure and density, in the limit of infinite volume, are given by 

^ = (7.260) 




Fig. 7.27. A schematic plot of the roots of 
Z M (T, V) = 0 in the complex y plane, (a) 
For finite V, no roots lie on the positive 
real axis, (b) For V = oo, roots can touch 
the positive real axis and separate regions, 
A and B, with different phase. 
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and 


; = = ^{ y ly'v H7 -‘‘ {T ' v)! ) ■ (7 ' 261) 

In general, the two operations limv—cc and y{djdy) cannot be interchanged 
freely. However, Yang and Lee proved that for the type of interaction 
considered in Eq. (7.255), the limit in Eq. (7.261) exist and the operations 
limy^oo and y(d/d y ) can be interchanged. The results of Yang and Lee are 
contained in the following theorems (proofs can be found in Ref. 22). 

Theorem I. For all positive real values of y, (1/V) ln[Z M (r, V)] approaches, as V — *■ oo, 
a limit which is independent of the shape of the volume V. Furthermore, the limit is a 
continuous monotonically increasing function of y. 

Theorem II. If in the complex y plane a region R containing a segment of the positive 
real axis is always free of roots, then in this region as V — > oo the quantities 

-ln[Z„(7\V)] and ( y-\ -In [Z M (7\ V)\ for n = 1, 2, . . . , oo, 


approach limits which are analytic with respect to y. Furthermore, the 
operations y(d/dy ) and limy-oo commute in R so that 

v)1 ) = y b “ » (y ln|z ' i(7V v)] )- 


Theorems I and II, together with Eqs. (7.260) and (7.261), enable us to 
obtain the following relation between v and P: 


1 

v 


d ( P \ 
y dy\k B TJ ' 


(7.262) 


They tell us that the pressure must be continuous for all y but that the 
derivatives of the pressure need only be continuous in regions of the positive 
real y axis where roots of Z M (T, V) = 0 do not touch the real axis. At points 
where roots touch, the derivatives of the pressure can be discontinuous. In 
general, if dP/dy is discontinuous, then the system undergoes a first-order 
phase transition. If a higher-order derivative is the first to be discontinuous, then 
the phase transition is continuous. 

If v is discontinuous at a point yo (where yo is a root of Z M ( T, V ) = 0), then it 
will decrease with y in the direction of increasing y. This can be proved as 
follows. Note that 
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The quantity, (( N — (N)) 2 ), is always greater than zero. Therefore, 1/v always 
increases and v always decreases with increasing y. 

Yang and Lee applied this theory of phase transitions to the two-dimensional 
Ising model [21, 23]. They found that for the Ising model the roots of 
Z^iT, V) = 0 all lie on the unit circle and close onto the positive real y axis in 
the limit of an infinite system. The point at which the roots touch the real axis 
gives the value of y for which the system undergoes a phase transition. 

It is of interest to consider an explicit example [24], We will consider a 
system with the following grand partition function: 


Z,= 


(1 +y) v (i -y v ) 

(i->) 


(7.263) 


where V is an integer. has V real roots at y = — 1 and it has complex roots of 
the form y = e lmk ' v , where k = 1,2, ... , V — 1. As the value of V increases, the 
density of roots on the unit circle increases and the roots get closer to the point 
y = 1 (cf. Fig. 7.28). 

The function P/k B T = limv/-, 00 (l/V / )ln[Z /i ] has different limiting values for 
y < 1 and y > 1 as V — > oo. We find 


P 

k^T 


lim —In 

V — *oo V 


\i+y) v (l-y v ) 

(! -y) 


ln(l+y) 
ln(l +y) + ln(y) 


if y < 1 
if y > 1. 
(7.264) 


Note that the pressure is continuous at y = 1. The volume per particle, v, is 
easily found from Eqs. (7.262) and (7.264): 


r y 

1 _ 9 ( p \ 

v y dy\k B Tj * 2y + 1 

, 0 + 30 ’ 


if y < i, 

if y > 1 . 


(7.265) 


♦ Im(y) 

• • • . 



Fig. 7.28. An example where the roots of the 
grand partition function lie on the unit circle in the 
complex y plane. 
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Fig. 7.29. A plot of P/k B T versus v for a system with grand partition function, 

— (1 + y) v (l -y) v /(l — y). 


Thus the volume per particle is discontinuous at y = 1 and we have a first-order 
phase transition. If we combine Eqs. (7.264) and (7.265), it is straightforward to 
show that the system has the following equation of state. 


P 





In [2] 


In 


v(l -v) ] 

(2v — l) 2 j 


if v > 2, 
if 2 > v > | , 

if 5>v>5- 


(7.266) 


In Fig. 7.29 we plot P/k B T as a function of volume per particle, v. We see that it 
has the behavior expected for a first-order phase transition. 
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PROBLEMS 

Problem 7.1. Compute the structure function for N noninteracting harmonic oscillators, 
each with frequency u; and mass m. Assume the system has total energy E. Using this 
structure function and the microcanonical ensemble, compute the entropy and the heat 
capacity of the system. 

Problem 7.2. A system consists of N noninteracting, distinguishable two-level atoms. 
Each atom can exist in one of two energy states, Eq — 0 or E\ = e. The number of atoms 
in energy level, E 0 , is n 0 and the number of atoms in energy level, E \ , is n\ . The internal 
energy of this system is U = noEo + n\E\. (a) Compute the entropy of this system as a 
function of internal energy, (b) Compute the temperature of this system. Under what 
conditions can it be negative? (c) Compute the heat capacity for a fixed number of 
atoms, N. 

Problem 7.3. A lattice contains N normal lattice sites and N interstitial lattice sites. The 
lattice sites are all distinguishable. N identical atoms sit on the lattice, M on the 
Interstitial sites, and N — M on the normal sites (N M 1). If an atom occupies a 
aormal site, it has energy E = 0. If an atom occupies an interstitial site, it has energy 
E ~ £ - Compute the internal energy and heat capacity as a function of temperature for 
this lattice. 

Pr oblem 7.4. Consider a lattice with N spin-1 atoms with magnetic moment p. Each 
at °m can be in one of three spin states, S z = — 1,0, +1. Let n_i,n 0 , and n\ denote the 
respective number of atoms in each of those spin states. Find the total entropy and the 
configuration which maximizes the total entropy. What is the maximum entropy? 
(Assume that no magnetic field is present, so all atoms have the same energy. Also 
as sume that atoms on different lattice sites cannot be exchanged, so they are 
distinguishable). 
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Problem 7.5. A system has three distinguishable molecules at rest, each with a 
quantized magnetic moment which can have z components +±/i or -±/i. Find an 
expression for the distribution function, f (i denotes the ith configuration), which 
maximizes entropy subject to the conditions Y2if = 1 and Xw M i,zf = 7M, where M i z is 
the magnetic moment of the system in the ith configuration. For the case 7 = 5 , compute 
the entropy and compute f. 

Problem 7.6. A fluid in equilibrium is contained in an insulated box of volume V. The 
fluid is divided (conceptually) into m cells. Compute the variance of enthalpy 
fluctuations, ((A Hi) 2 ), in the ith cell (For simplicity assume the fluctuations occur at 
fixed particle number, N t ). (Hint: Use P and S as independent variables.) 

Problem 7.7. A fluid in equilibrium is contained in an insulated box of volume V. The 
fluid is divided (conceptually) into m cells. Compute the variance of internal energy 
fluctuations, {(At/,) 2 ), in the ith cell (For simplicity assume the fluctuations occur at 
fixed particle number, At,-). What happens to the internal energy fluctuations near a 
critical point? 

Problem 7.8. What is the partition function for a van der Waals gas with N particles? 
Note that the result is phenomenological and might involve some guessing. It is useful to 
compare it to the partition function for an ideal gas. Remember that the particles are 
indistinquishable, so when using the partition function one must insert a counting factor. 
Use this partition function to compute the internal energy, U(N, T, V), the pressure, 
P(N, T, V), and the entropy, S(U, V,N). 

Problem 7.9. Consider a solid surface to be a two-dimensional lattice with N s sites. 
Assume that N a atoms (N a <sC N s ) are adsorbed on the surface, so that each lattice site 
has either zero or one adsorbed atom. An adsorbed atom has energy E = —e, where 
£ > 0. Assume the atoms on the surface do not interact with one another. 

(a) If the surface is at temperature 7, compute the chemical potential of the adsorbed 
atoms as a function of T, e, and N a /N s (use the canonical ensemble). 

(b) If the surface is in equilibrium with an ideal gas of similar atoms at temperature 7, 
compute the ratio N a /N s as a function of pressure, P, of the gas. Assume the gas 
has number density n. (Hint: Equate the chemical potentials of the adsorbed 
surface atoms and the gas.) 

Problem 7.10. Consider a two-dimensional lattice in the x-y plane with sides of length 
L x and Ly which contains N atoms (N very large) coupled by nearest-neighbor harmonic 
forces, (a) Compute the Debye frequency for this lattice, (b) In the limit 7 — ► 0, what is 
the heat capacity? 

Problem 7.11. A cubic box (with infinitely hard walls) of volume V = l) contains an 
ideal gas of N rigid HC1 molecules (assume that the effective distance between the H 
atom and the Cl atom is d — 1.3 A. (a) If L = 1.0 cm, what is the spacing between 
translational energy levels? (b) Write the partition function for this system (include both 
translation and rotational contributions). At what temperature do rotational degrees of 
freedom become important? (c) Write expressions for the Helmholtz free energy, the 
entropy, and the heat capacity of this system for temperatures where the rotational 
degrees of freedom make a significant contribution. 

Problem 7.12. An ideal gas is composed of N “red” atoms of mass m, N “blue” atoms 
of mass m, and N “green” atoms of mass m. Atoms of the same color are 
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indistinguishable. Atoms of different color are distinguishable, (a) Use the canonical 
ensemble to compute the entropy of this gas. (b) Compute the entropy of an ideal gas of 
3N “red” atoms of mass m. Does it differ from that of the mixture? If so, by how much? 

Problem 7.13. An ideal gas consists of a mixture of “green” and “red” spin-i particles. 
All particles have mass m. A magnetic field, B, is applied to the system. The “green” 
particles have magnetic moment 70 , and the “red” particles have magnetic moment 7 *, 
where 7 /? < 7 g- Assume the temperature is high enough that Fermi statistics can be 
neglected. The system will be in equilibrium if the chemical potentials of the “red” and 
“green” gases are equal. Compute the ratio N r /Ng, where N R is the number of “red” 
particles and N G is the number of “green” particles. Use the canonical ensemble (no 
other ensemble will be accepted). 

Problem 7.14. Consider a one-dimensional lattice with N lattice sites and assume that 
the ith lattice site has spin Sj = ±1. the Hamiltonian describing this lattice is 
H = -e^Zi s ‘ s ‘+ 1 - Assume periodic boundary conditions, so s N+l = 5 !. Compute the 
correlation function, (si^). How does it behave at very high temperature and at very 
low temperature? 


Problem 7.15. In the mean field approximation to the Ising lattice, the order parameter, 
( s ), satisfies the equation (s) = tanh((^) ^), where T c = ve/2 k B with e the strength of 
the coupling between lattice sites and v the number of nearest neighbors, (a) Show that 
(s) has the following tem perature dependence: (i) (j) « 1 — 2e~ 1Tc l r if T ~ 0 K, and 
(ii) (s) s v/3(l -T/T c ) if T ~T C . (b) Compute the jump in the heat capacity at 
T = T c . (c) Compute the magnetic susceptibility, xt,n{B = 0), in the neighborhood of 
T = T c for both T > T c and T < T c . What is the critical exponent for both cases? 

Problem 7.16. The density of states of an ideal Bose-Einstein gas is 



if E > 0, 
if E < 0, 


where a is a constant. Compute the critical temperature for Bose-Einstein condensation. 

Problem 7.17. An ideal Bose-Einstein gas consists of noninteracting bosons of mass m 
which have an internal degree of freedom which can be described by assuming, that the 
bosons are two-level atoms. Bosons in the ground state have energy Eq = p 2 /2m, while 
bosons in the excited state have energy E\ = p 1 /2m 4 - A, where p is the momentum and 
A is the excitation energy. Assume that A k B T. Compute the Bose-Einstein 
condensation temperature, T c , for this gas of two-level bosons. Does the existence of the 
internal degree of freedom raise or lower the condensation temperature? 

Problem 7.18. Compute the Clausius-Clapyron equation for an ideal Bose-Einstein gas 
and sketch the coexistence curve. Show that the line of transition points in the P-v plane 
obeys the equation 


p v 5/3 = 27Tfl 2 #5/2(1) 

m U3 /2 (i)) 5/3 ‘ 


Problem 7.19. Show that the pressure, P, of an ideal Bose-Einstein gas can be written in 
l be form P = au, where u is the internal energy per unit volume and a is a constant, 
(a) What is u? (b) What is a? 
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Problem 7.20. Electrons in a piece of copper metal can be assumed to behave like an 
ideal Fermi gas. Copper metal in the solid state has a mass density of 9gr/cm 3 . Assume 
that each copper atom donates one electron to the Fermi gas. Assume the system is at 
T = OK. (a) Compute the Fermi energy, e F , of the electron gas. (b) Compute the Fermi 
“temperature,” T> = 

Problem 7.21. The density of states of an ideal Fermi-Dirac gas is 



if E > 0, 
if E < 0, 


where D is a constant, (a) Compute the Fermi energy, (b) Compute the heat capacity at 
very low temperature. 

Problem 7.22. Compute the magnetization of an ideal gas of spin— fermions in 
the presence of a magnetic field. Assume that the fermions each have magnetic moment 
H e . Find an expression for the magnetization in the limit of weak magnetic field and 
T -> 0 K. 


Problem 7.23. Show that the entropy for an ideal Fermi-Dirac ideal gas (neglecting 
spin) can be written in the form 

5 = ~ k s £{<”i>M<*i>] + (1 - («i»ln[l - <«i)]}, 

l 

where(nj) = (g/^i-M) + l) -1 . 

Problem 7.24. To lowest order in the density, find the difference in the pressure and 
isothermal compressibility between an ideal boson and an ideal fermion gas. Assume 
that the fermions and bosons have the same mass and both are spinless. (Note: You are 
now considering fairly high temperature.) 

Problem S7.1. Show that near the critical temperature the gap function, A (T), in a 
weakly coupled, condensed Fermi fluid (superconductor) in the mean field approxima- 
tion has temperature dependence 


A (T) 
A(0) 


1.74 



1/2 


where T c is the critical temperature and A(0) is the gap function at T = 0 K. 

Problem S7.2. The unitary matrix, Vk ) , diagonalizes the effective 

/ ,r. a. \ V~ v k «k/ 


Hamiltonian = 


£k A k 

A k - £ k 


. Compute Vk and u k . 



ORDER-DISORDER TRANSITIONS 
AND RENORMALIZATION THEORY 


8.A. INTRODUCTION 

In previous chapters we have used mean field theory to construct a variety of 
models of equilibrium systems which undergo second-order phase transitions. 
In this chapter we will show how the idea of scaling leads to a universal theory 
of critical phenomena, and we will derive some exact results for order-disorder 
transitions. 

A useful way to obtain information about fluctuations in an equilibrium 
system is linear response theory. Experimentally, we can probe equilibrium 
systems with a weak external field which couples to the thermodynamic 
quantities of interest. For example, a magnetic field couples to the magnetiza- 
tion and allows us to probe magnetization fluctuations in a system. The way in 
which a system responds to an external field is determined by the type of 
fluctuations which occur in it. Indeed, as we shall see, the response function can 
be expressed directly in terms of the correlation functions for equilibrium 
fluctuations. In this chapter we shall use time-independent linear response 
theory to obtain a relation between the long-wavelength part of the equilibrium 
correlation functions and the static response functions. We will then use mean 
field theory to show that near critical points, fluctuations become correlated 
over a wide region of space, indicating that long-range order has set in. 

If we are to describe the thermodynamic behavior of systems as they 
a Pproach a critical point, we must have a systematic way of treating 
thermodynamic functions in the neighborhood of the critical point. Such a 
Method exists and is called scaling. We can write the singular part (the part 
a tfected by the phase transition) of thermodynamic functions near a critical 
Point in terms of distance from the critical point. Widom was first to point out 
•bat as the distance from the critical point is varied, thermodynamic functions 
change their scale but not their functional form. The idea of scaling can be 
expressed mathematically by saying that the thermodynamic functions are 
lom °geneous functions of their distance from the critical point. As we shall see, 
l he idea of scaling underlies all theories of critical phenomena and enables us to 
°btain new equalities between various critical exponents. The scaling behavior 
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of thermodynamic functions near a critical point has been verified experimen- 
tally. 

Kadanoff was able to apply the idea of scaling in a very clever way to the 
Ising model and in so doing opened the way for the modem theory of critical 
phenomena introduced by Wilson. The idea behind Kadanoff scaling is the 
following. As the correlation length increases, we can rescale (increase) the size 
of the interacting units on the lattice. That is, instead of describing the lattice in 
terms of interacting spins, we describe it in terms of interacting blocks of spin. 
We take the average spin of each block and consider it as the basic unit. As the 
system approaches the critical point, the correlation length gets larger and the 
block size gets larger in such a way that the thermodynamic functions do not 
change their form, but they do rescale. With this picture, Kadanoff was able to 
find a relation between the critical exponents associated to the correlation 
length of fluctuations and the critical exponents which were introduced in 
Section 3.H. 

Wilson carried Kadanoff’ s idea a step farther and introduced a means of 
computing critical exponents microscopically. Wilson’s approach is based on a 
systematic rescaling of the effective Hamiltonian which describes a system near 
the critical point. As the correlation length increases near a critical point, one 
repeatedly integrates out the effect of shorter-ranged correlations and requires 
that the Hamiltonian retain the same functional form. This leads to nonlinear 
recursion relations between the effective coupling constants on different length 
scales. The critical point corresponds to a fixed point of these recursion 
relations. The eigenvalues of the transformation matrix (linearized about the 
fixed points) which yields the recursion relations can be expressed in terms of 
the critical exponents. Therefore, if we can find the eigenvalues, the problem is 
solved. In this chapter we will show how Wilson’s theory can be applied to 
some simple models, such as the triangular planar spin lattice, and in the special 
topics section section we will apply it to the S 4 model. We will leave some 
examples for homework problems. 

Finally, as a special topic, we shall derive Onsager’s exact expression for 
the heat capacity of a two-dimensional square planar Ising spin lattice, and 
we shall show that this system does indeed have a critical point. The method 
we use to derive Onsager’s original result is not the same as that used by 
Onsager. Instead we follow a procedure developed by Kasteleyn and by Fisher 
using dimer graphs. The derivation of the Onsager result is then fairly 
straightforward. 


8.B. STATIC CORRELATION FUNCTIONS AND 
RESPONSE FUNCTIONS [1, 2] 

In this section we will investigate some general properties of static correlation 
functions and response functions in an equilibrium system. These will prove 
useful when we discuss scaling properties of equilibrium systems in subsequent 
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sections. We will first give some general relations and then use mean field 
theory to help build intuition about their qualitative behavior. 

8.B.I. General Relations 

Let us consider a system of N particles in a container of volume V at a fixed 
temperature T. We will assume that the ith particle has spin S,-, magnetic 
moment fi, momentum operator p f , and position operator q,. The magnetization 
density operator is defined as 

N 

m(r) ^(q/~ r ). (8-1) 

i= 1 

The total magnetization operator is 

f N 

M= drm(r) = fi V] h- (8.2) 

,= i 

If a magnetic induction field, B(r), is present, the total Hamiltonian may be 
written 


H = Hq — I fifrm(r) • B(r), (8.3) 

Jv 

where Ho is the unperturbed Hamiltonian: 

N 2 

Ho = E ^; + E < 8 - 4 ) 

1=1 (y) 

In Eq. (8.4), the first term on the right is the kinetic energy. The second term on 
the right is the interaction energy between particles, and the summation is over 
all pairs of particles. 

Let us now assume that the applied magnetic field, B, is constant throughout 
the volume, V. The average magnetization in the presence of this field is 


(M) b = 


Tr [e-^M] 
Tr [e~PH] 


(8.5) 


If we let M a denote the a* component of the magnetization operator, M, where 
— x ,y,z, then we can write for the static magnetic susceptibility 




' = (^| S 2 — ) = - (M a )(M a ,)) 

\ 015 a' J T,N,B = 0 

= 0{(M a - <*„»(*„. - (M a ,))). 


( 8 . 6 ) 
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Note that the susceptibility as we have defined it is independent of the applied 
magnetic field. It contains information about the thermodynamic properties of 
the unperturbed system. 

Let us now introduce the magnetization density fluctuation, Sm Q (r) = 
m a { r) - (m a ( r)). Then the static susceptibility can be written 


Xa,a' 



dr 2 (Sm Q (r 1 )Sm a >(r 2 )) = (3 


dr\ 


v 


<fr 2 C a , 0 /(ri,r 2 ), 

v 


(8.7) 


where 


C a , a '(r u r 2 ) = (6m a (r\)6m a > (r 2 )) (8.8) 


is the static spatial correlation function between magnetization density 
fluctuations at points r t and r 2 in the system. 

For systems with very large volume, V, we can neglect boundary effects if 
there are no spatially varying external forces present, and the static spatial 
correlation function will depend only on the relative displacement, r = rj — r 2 , 
of the two points in the system. Then we can write 

Ca.a'M = (<5m a (r)<5m Q /(0)). (8.9) 


The static susceptibility becomes 


Xa,a' = PV 


drC Q , a >{ r). 


( 8 . 10 ) 


It is useful to introduce yet another quantity, the static structure factor, 


(*a,a'(k) 


dr e tk ' r C a , a '( r). 
v 


( 8 . 11 ) 


This can also be written 


^a,a'00 — y 


dri ^r 2 e ,lt (ri r2) (<5m Q (ri)<5m Q /(r 2 )) 
Jv 


(8.12) 


= y (<5m Q (k)<5m Q /(-k)). 


In Eq. (8.12) we have made use of the Fourier decomposition of the spatially 
varying fluctuations 


6m a ( r) = -^]TV ,k ' r <5m Q (k). 

v k 


(8.13) 
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If we now compare Eqs. (8.7) and (8.12), we see that the static susceptibility 
can be written in terms of the infinite wavelength component of the static 
structure factor 


Xw = 0VG aiO '(k = 0). (8.14) 

Thus, we see that the way in which a system responds to a constant external 
field is completely determined by the long-wavelength equilibrium fluctuations. 
In Chapter 3 we found that the static susceptibility becomes infinite as we 
approach the critical point. Thus, Eq. (8.14) tells us that near the critical point 
the correlation function will have a large long-wavelength component, 
indicating that long-range order has begun to occur. 

8.B.2. Application to the Ising Lattice 

Consider now a three-dimensional Ising lattice with volume V. Assume that 
the lattice has N lattice sites. Let us break the lattice into blocks of volume A, 
so each block has n = (A )V)N spins. Let us assume the lattice has a 
temperature, T, which is above the critical point so the total magnetization 
is zero, (M) = 0. Let us assume that the z component of magnetization of 
the block is m/ (mi can be positive or negative and can have values ranging 
from mi = —n\i to mi — +n/i). Since the total magnetization is zero, we 
must have J2i( m i) = 0- If the block sizes are large enough so that n is large 
and each block achieves equilibrium locally, then we can also assume that 
(mi) = 0. 

Let us now analyze the system using Ginzburg-Landau mean field theory 
(cf. Section 3.G). We can write a phenomenological expression for the partition 
function 


Z n (T) = (8.15) 

{"*/} 

where 4>{mi] is the free energy density of the system, is the probability 

of finding the lattice in a configuration {mi}, and the summation is over all 
possible configurations of the lattice. Since we require that (mi) = 0, 0{m/} 
must be an even function of mi. 

If we assume that the lattice is very large, then we can let the discrete spatial 
variation of the local magnetization density become continuous, m a — > 6m(r). 
For small fluctuations away from equilibrium we can write 

${<5m(r)} = ;3a(T) + ^C,(T) f di(6m(r)) 2 

+ \c 2 (T) | dr(V6m( r)) . (V&»( r)) + ■ . ■ , 


(8.16) 
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where a(T) is the nonmagnetic free energy density. Let us next note that 

dr(8m(r)) 2 = V' 8m(k)6m(— k), (8-17) 

JV * 

where 6m(— k) = 8m* ( k) and 

dr(V6m(r)) • (V6m(r)) = -j- y~^k 2 8m(k)8m(— k). (8.18) 

.v V ^ 

The free energy can then be written 

0{&»C k} = MT) + ^T i (C>(T) + k 2 C 2 (T))Sm(k)6m(-k) + ■■■. (8.19) 

We can use this free energy to obtain the probability for a fluctuation, 6m (k), to 
occur. It is 


P(Sm(k)) = Cexpf-~(C, + * 2 C 2 )|<5m(k)| : 


( 8 . 20 ) 


where C is a normalization constant. With the probability density in Eq. (8.20), 
we can compute the static structure factor. We find 


G(k) = 


</[6m(k)]P(6m(k)) |6m(k) 
1 


= (IMk)l > = 


Cl + it 2 C 2 


( 8 . 21 ) 


The static susceptibility is given by 


X = PVG{ k = 0)=£-. (8.22) 

w 

Near a phase transition, the susceptibility behaves as x ~ (T ~ T c )~ l (cf. 
Section S3.C). Therefore, C\ « (T — T c ). 

The static correlation function is given by 


c ( r )=?E G ( k ) e "' kr = 

V k 


f d k e - ,lc r 
) (2 tt) 3 (Ci+k*C 2 ) 


t_ e ~ry/c^ 


(8.23) 


The correlation function has a length £ ~ y/CiJci- Since C\ « (T — T c ) near a 
critical point, the range £ « \J I (j — T~<) goes to infinity as (T — T c ) 1/2 as 
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we approach a critical point. Therefore, at the critical point the spatial 
correlations between fluctuations extend across the entire system. 


8.C. SCALING [1, 3] 

As we approach the critical point, the distance over which fluctuations are 
correlated approaches infinity and all effects of the finite lattice spacing are 
wiped out. There are no natural length scales left. Thus we might expect that in 
the neighborhood of the critical point, as we change the distance from the 
critical point (for example, by changing the temperature), we do not change the 
form of the free energy but only its scale. 

The idea of scaling underlies all critical exponent calculations. To under- 
stand scaling, we must first introduce the concept of a homogeneous function. 

8.C.I. Homogeneous Functions 

A function F(Xx) is homogeneous if for all values of A, we obtain 

F(Xx) = g(\)F(x). (8.24) 

The general form of the function g( A) can be found easily. We first note that 
F(Xfix) = g(Xfi)F{x) = g{X)g(n)F(x) (8.25) 


so that 


g(Xfj) =g(X)g(fi). (8.26) 

If we take the derivative with respect to fi, we find 

-^g(Xfi) = Xg'(Xfi) = g(X)g'(fi), (8.27) 

where g'(fi) = dg{n)Jdfi. We next set fi = 1 and g'(l) = p. Then 

Xg\X)=pg(X). (8.28) 

If we integrate from 1 to A and note that g(l) = 1, we find 


Thus, 


gW = A'. 


(8.29) 


F(Xx) = A p F{x) 


(8.30) 
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and F(x) is said to be a homogeneous function of degree p. A homogeneous 
function has a very special form. In Eq. (8. 30), if we let A = x~ x , we obtain 

F(x)=F( l)x p . (8.31) 

Thus, the homogeneous function F(x) has power-law dependence on its 
arguments. 

Let us now consider a homogeneous function of two variables f(x,y). Such a 
function can be written in the form 

f(\<’x,\«y) = \f(x,y) (8.32) 

and is characterized by two parameters, p and q. It is convenient to write /( jc, y) 
in another form. We will let A = y~ x ! q . Then 

f(x,y)=y Vq f(^j- q , i), (8.33) 

and we see that the homogeneous function, f{x,y), depends on jc and y only 
through the ratio x/y p / q aside from a multiplicative factor. We can now apply 
these ideas to thermodynamic quantities near the critical point. 

8.C.2. Widom Scaling [4] 

As we have seen in numerous examples, when a phase transition occurs in a 
system, part of the free energy begins to behave in a manner which leads to 
singular behavior in some of the thermodynamic response functions. If we 
assume that the “singular” part of the free energy scales, then we can find a 
relation between various critical exponents which agrees with experiment. We 
will consider magnetic systems since they give a simple picture, and we shall 
assume that a magnetic induction field, B, is present. Let us write the free 
energy per lattice site in terms of a regular part, g r (T,B), which does not 
change in any significant way as we approach the critical point, and a singular 
part, g s (£,B), which contains the important singular behavior of the system in 
the neighborhood of the critical point. Then 

s(7\B)=s r (7\B)+ ft ( e> B), (8.34) 

where s = (T — T c )/T c and T c is the critical temperature. 

We shall assume that the singular part of the free energy is generalized 
homogeneous function of its parameters, 

g s {\ p e,\‘ l B) = \g s {e,B). (8.35) 


We now write the free energy as a function of the magnitude of B. For 
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the systems we consider here, its direction does not play an important 
role. The critical exponents in Section 3H can all be determined in terms 
n f d and a. Let us first find an expression for (3, which is defined [cf. 
Eq. (3.85)] as 

MU,B = 0) ~ (8.36) 

If we differentiate Eq. (8.35) with respect to B, we obtain 

\ q M{\ p e,\ q B) = \M(e,B). (8.37) 

If we next let A = {-£)~ l/p and set B — 0, we obtain 

M( £ , 0) = (_ £ )< 1 -«>/i’M(-l,0). (8.38) 

Thus, 

0 = Ll? (8.39) 

P 


and we obtain our first relation. 

Let us next determine the exponent 6 (the degree of the critical isotherm), 
which is defined [cf. Eq. (3.84)] as 

M(0,fi) = |5| 1/l5 sign B. (8.40) 

If we set e = 0 and A = B~ x / q in Eq. (8.35), we can differentiate with respect to 
B and obtain 


M(0,B) = B^ l ~ q ^ q M(0, 1). 


(8.41) 


Thus, 


1 -q 


(8.42) 


and we obtain our second relation. 

The magnetic susceptibility is obtained from the thermodynamic relation 


X = - 


dB 2 ) T \ 


(-£) 


-i 


T < T c , 
T > T c . 


(8.43) 


differentiating Eq. (8.37) with respect to B, we can write 

A^x(A^,A^)-A X (e,B). 


(8.44) 
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If we now set B = 0 and let A = (e) 1//p , we find 


x(£, 0 ) = £ (1 2q V p x(l,0). 


Thus, the critical exponent for the susceptibility is 


7 = 


2q — 1 

P 


(8.45) 


(8.46) 


and we obtain our third relation between p and q and the critical exponents. By 
a similar calculation, we find that 7 = 7 '. 

The heat capacity at constant B is given by 

Cb = ~ T (0) ~ (£)"" (8-47) 

[cf. Eq. (3.86)]. From Eq. (8.35), we obtain 

X 2p C B {X p e,X q B ) = A C B (e,B). (8.48) 

If we set B = 0 and A = (e) -1/// \ we find 

C fl (e,0) = e( 1 - 2 ^C fl (l,0), (8.49) 

and therefore 

a = 2-- (8.50) 

P 

is our fourth relation. By a similar calculation we find a = a'. 

In Eqs. (8.39), (8.42), (8.46), and (8.50), we have obtained the four critical 
exponents, a , (3, 7, and <5, in terms of the two parameters p and q. If we combine 
Eqs. (8.39), (8.42), and (8.46), we find 

7 ' = 7 = /?($-l). (8.51) 

From Eqs. (8.39), (8.42), and (8.50) we find 

a + 0(6+1) = 2. (8.52) 

Thus, the Widom scaling assumption allows us to obtain exact relations 
between the critical exponents. These relations agree with mean field theory 
(a = 0, (3 = \, 6 = 3, 7 = 1 ) as one can easily check. They also agree with 
experimentally obtained values of the critical exponents which generally differ 
from mean field results (cf. Table 8.1). 
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For later reference, it is useful to express p and q in terms of the critical 
exponents. We find 


1 1 

£(<$+!) 


(8.53) 


and 



(8.54) 


The scaling property for systems near the critical point has been verified 
experimentally for fluids [5] and magnetic systems [1]. 


8.C.3. Kadanoff Scaling [6] 

Kadanoff has shown how to apply the idea of scaling to the Ising model. Let us 
consider a ^-dimensional Ising system with nearest-neighbor coupling (T 
nearest neighbors). The Hamiltonian is 

TN/2 N 

H{S\ = -K £ S,Sj -B^S„ (8.55) 

W ) (=1 

where N is the number of lattice sites. We will divide the lattice into blocks of 
length La, where a is the distance between sites (cf. Fig. 8.1). We choose L so 
that La <C £ where £ is the correlation length of spin fluctuations on the lattice 
[cf. Eq. (8.23)]. The total number of spins in each block is L d . The total number 
of blocks is NL~ d . The total spin in block I is 

$;' = ]•>. (8.56) 

iel 

Since L is chosen so that La <C £, the spins in each block will be highly 
correlated and it is likely that they will be aligned to some extent. In view of 
this, it is useful to define a new spin variable, Si, through the relation 

S; = ZS h (8.57) 

where S/ = ±1 and Z = L y . 

Spins interact with nearest-neighbor spins, so blocks should also interact 
with nearest-neighbor blocks. Thus, the block Hamiltonian will be of the form 

TNL- d /2 NL~ d 

H{S l } = -K l Y, SjSj-BlYSi 

(IJ) 7=1 


(8.58) 
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Fig. 8.1. Decomposition of a square lattice into square blocks whose sides have length 
La = 4 a. 

where Kl is the new effective interaction between nearest-neighbor blocks. The 
block Hamiltonian looks exactly like the site Hamiltonian except that all 
quantities are rescaled. Therefore, we expect the free energy per block, 
g{ELi B l ), to have the same functional form as the free energy per site, g(e,B). 
Since there are L d sites per block, we have 

g{e L ,B L ) = L d g(e,B). (8.59) 

If we rescale our system and describe it in terms of blocks rather than sites, we 
reduce the effective correlation length (measured in units of La) and therefore 
we move farther away from the critical point. Thus, the correlation length will 
behave as 

£l{£l,Bl) = L~ 1 £(e,B). (8.60) 

Since rescaling moves us away from the critical point, the temperature e and 
magnetic field B must also rescale. We assume that 

e l = eL\ (8.61) 

where x is positive. Similarly, 

N NL~ d NL~ d NL~ d 

bY. s ‘= b Y / Y, s ‘= b Y. s i' = bz Y. s '’ < 862) 

i=l /=1 iel 7=1 7=1 


so that 


B l = BZ = L y B. 


(8.63) 
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Equation (8.58) now becomes 

g(L x e,UB) = L d g(e,B). (8.64) 

If we compare Eq. (8.64) with Eq. (8.35), we find x = pd and y = qd. Thus, 

q < 1 (8.65) 


in agreement with experiment 

The Kadanoff view of scaling allows us to introduce two new critical 
exponents which are associated with the spatial correlations of spin fluctuations 
in the system. The block correlation function is defined 


C(r L ,e L ) = (S l Sj)-(S l )(Sj), ( 8 . 66 ) 

where tl is the distance between blocks / and J in units of La. We can write 
Eq. (8.66) as 

c(r t> £ t ) = z- 2 [(s;s;)-(s;)(si>] 

= (867) 

iel jeJ V ’ 

= Z~ 2 (L d ) 2 [{SiSj) - {Si)(Sj}) = Z~ 2 (L d ) 2 C{r, s), 


where r is the distance between sites i and j on different blocks. The distances r L 
and r are related by the expression 


II 

(8.68) 

and we write 


C(L~ l r, eL x ) =L 2{d ~ y) C(r,£). 

(8.69) 

If we choose L = r/a , the correlation function takes the form 



(8.70) 

^e can now introduce two new exponents for the correlation function. We first 
define a critical exponent, v, for the correlation length as 

C~(r-r c )-*'. 

(8.71) 


or mean field theories v = From Eq. (8.23) we see that the correlation 
function away from the critical point depends on r and e in the combination 
r /£ = re v . In Eq. (8.70) the correlation function depends on r and e in the 
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combination er x . Thus, 


x = pd = v-\ (8.72) 

At the critical point, e = 0 and the correlation function varies as 

C(r,0)~(rf y - d >. (8.73) 

In three dimensions, we expect the correlation function at the critical point to 
behave as 


C(r,0) 

where rj is another new exponent, 
dimensions, C(r, 0) varies as 

CM) = 

and we can make the identification 


1 

rv I — 

r 


1 +T) 


(8.74) 

For mean field theories, rj = 0. In d 


id- 2 +»?) 


(8.75) 


(d-2 + r]) = 2{d~y) = 2d(l - q) (8.76) 


Thus, the exponents for the correlation function can be written in terms of the 
exponents for the thermodynamic quantities we have already considered. From 
Eqs. (8.50) and (8.72) we find 


v = 


2 — a 
d 


(8.77) 


and from Eqs. (8.54) and (8.76) we find, 


f] = 2- 



= 2 - 


dy 

2(3 + 7 ' 


(8.78) 


Thus, Kadanoff scaling allows us to obtain two new critical exponents and new 
identities between all the exponents. 


8.D. MICROSCOPIC CALCULATION OF CRITICAL 
EXPONENTS [3, 7-11] 

The Kadanoff picture of scaling was given firm mathematical foundation by 
Wilson [12], who developed a technique for computing the critical exponents 
microscopically. We shall outline Wilson’s procedure for the case of spin 
systems, and then in Exercise 8.1 we apply it to a triangular planar lattice. 
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Let us consider a system described by the partition function 

Z(K,AT) = £exp[-jr(K,{S, },#)]. (8.79) 

«> 

The effective Hamiltonian, { 5,}, jV ) (which includes temperature), can be 

written in the form 

( 1 ) ( 2 ) 

Jf(K, {if}, AT) =K 0 + K,J2^ + ^Y, S ‘ S i + fC iH S ‘ S J 

<U) M (8.80) 

(1) ' ' 

+ 7 m ^ ^ SjSjSk + • • • , 

(»'../>*) 


where K is an infinite-dimensional vector containing all coupling constants, and 
the summation ]T^ means that only (ith) nearest neighbors are included. The 
coupling constants, AT,, contain the temperature. For the Ising model, 
K{ = ~/3B, K 2 = -(3 J where J is the strength of the coupling between spins, 
and K 3 = K 4 = • • • = 0. 

We can introduce blocks and sum over spins, cr h interior to each block. Thus, 
Z(K, N) = J2 txp[-je(K,{S,,a,},N)} 

TL”' 1 (8.81) 

= 53exp[-jr’(Kt,{S,},Att.'‘')]=Z(K I .,A7L-‘'). 

W 

Since the new partition function has the same functional form as the old one, we 
can write the following expression for the free energy density per site: 

*(K) = lim i lnZ(K, N) = lim ilnZ(K L ,NL~ d ) = L' d g( Kt). (8 82) 

N-+00 jy N — >oo /V 


The coupling constant vectors, K and K*,, of the site spin and block spin 
systems, respectively, will be related by a transformation, 


Kl = T(K), (8.83) 

where will be a vector whose elements are nonlinear functions of the 
c °mponents of K. Since our new Hamiltonian is identical in form to the old 
or >e, we can repeat the process and transform to even larger blocks nL. After n 

transformations we find 


k„ l = T(K (b _ 1)l ). 


(8.84) 
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If the system is not critical, there will be a finite correlation range. Thus, when 
we transform to larger blocks the effective correlation range appears to shrink 
and we move away from the critical point. However, when the system is critical, 
the correlation range is infinite and we reach a fixed point of the transformation. 
At the fixed point, the transformation T can no longer change the vector K. 
Thus, the critical point occurs for values of the coupling constant vectors, K*, 
which satisfy the condition 


K* - T(K*). (8.85) 

The sequence of transformations T is called the renormalization group 
(although T only has properties of a semigroup). The motion of the vector K 
in coupling constant space is similar to the motion of the vector describing a 
state point in phase space, except that the motion of K occurs in discrete jumps 
rather than continuously. 

It is useful to illustrate the possible motions of K for the case of a two- 
dimensional vector, K = (K\,K 2 ). Equation (8.84) can be thought to describe 
the motion of K in K space (the space of components of K) as we change block 
size. To locate a critical point in K space, we must locate the fixed points of Eq. 
(8.84). The procedure is identical to that used to find and classify the fixed 
points of classical dynamical equations. Let us assume that a fixed point of the 
transformation K L = T(K) occurs at K* = (A?, K^). We will want to know 
how the vector K moves in the neighborhood of the point K*. We must linearize 
Eq. (8.84) about K*. We will let 8K L = (K L - K*) and 8K = (K - K*). Then 
for small 8 K L and 8 K we get a linearized transformation 


<5 Kl = A • <5 K, 


where 


A = 


[ dK lL 
dKi 
dK2L 
\ dK\ 


dKu\ 
dK 2 
dK 2L 
dK 2 j 


*1 


(8.86) 


(8.87) 


We next find the eigenvalues and eigenvectors of the matrix A. Since A, in 
general, will be nonsymmetric, we must use the method of Section 5.C. The 
eigenvectors can be written 


8u l = A <5u, 


( 8 . 88 ) 


where A is the matrix of eigenvalues, 


A = 




J 


(8.89) 
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Fig. 8.2. A hyperbolic fixed point with its eigencurves and the flow of points in the 
neighborhood of the fixed point. 


and <5u is the right eigenvector, 


<5u = 



(8.90) 


The eigenvalues At and A2 of the matrix A determine the behavior of 
trajectories in the neighborhood of the fixed point. In Fig 8.2 we have drawn the 
case of a hyperbolic fixed point and its eigencurves. Points along the 
eigencurves move as 


SunL, 1 = (Ai)”<5«i, (8.91) 

<5«nL,2 = (A2)”<5«2- (8.92) 

Thus, for A > 1 the point moves away from the fixed point under the 
transformation, and for A < 1 it moves toward the fixed point. The dashed lines 
represent the trajectories of points which do not lie on the eigencurves. For a 
hyperbolic fixed point, they will always move away from the fixed point after 
many transformations. All systems with vectors K lying on an eigencurve with 
eigenvalue A < 1 are critical, since with enough transformations they will come 
arbitrarily close to the fixed point. Such systems are said to exhibit 
universality.” The behavior of a point along an eigencurve with A > 1 is 
reminiscent of the actual behavior of noncritical systems. As we increase the 
block size, we move away from the critical point. Thus, an eigenvalue A > 1 is 
called relevant and its eigenvector is identified as one of the physical quantities 
( £ or B, for example) which measure the distance of the system from the critical 
Point. 
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In general we write the singular part of the free energy density in terms of the 
eigenvectors <5«, and eigenvalues A, as follows: 


g s (6ui,6u 2 ,6u 3 ,...) =L d g s (\i8ui, \ 2 6u 2 , A 3 <5w 3 , . . .) (8.93) 


[cf. Eq. (8.82)]. This looks very much like Widom scaling. Indeed, for the case 
of an Ising system for which there are two relevant physical parameters which 
measure the distance of the system from the critical point, we expect that two of 

the eigenvalues will be relevant, let us say Ai > 1 and A 2 > 1 . 
Eq. (8.93) with Widom scaling of the Ising model, 

If we compare 


(8.94) 

we can make the identification 8u\ = e and 8u 2 = B. Thus, 


0 

II 

<■< 

(8.95) 


(8.96) 

and 



(8.97) 


If we now use Eqs. (8.53) and (8.54), we have expressed the critical exponents 
in terms of the relevant eigenvalues. 

In Exercise 8.1 we use Wilson’s theory to obtain approximate expressions for 
the critical exponents of a triangular planar lattice [13]. 


■ EXERCISE 8.1. Compute the critical exponents for 3-spin blocks on the 
triangular planar lattice for the two dimensional nearest-neighbor Ising 
model. Retain terms to lowest order in (V). 



(a) 
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Answer: The Hamiltonian can be written 

H = -Kj2 Si Sj-Bj2 s i- 0 ) 

y * 

We assign to a single block (block I in the figure below) a spin given by 
Sj = sign(s{ +s[ + $ 3 ). Therefore, Sj = +1 for the following configura- 
tions: 


(a = 1; m)(a = 2; UT)(a - 3; T U)(« = 4; TTT) 

The internal spin degrees of freedom are defined, — \ s \ + ^2 4* 

For configurations defined above we have: a} = a] = a) = 1, and cr? = 3. 

3 



(6) 


The partition function can be written 

Z(K l ,N) = ^exp[-ff(K t ,{5,})] = ££exp[-»(K, {S,,<x,})]. 

{ 5 ,} { 5 ,} {a,} 

(2) 

Now define 

H{ K, {5/, a,}) = H 0 (K, { S h a,}) + V(K, {5/, a,}), (3) 


where 


and 


H 0 (K, {S,, a,}) = -AT £ J2 J2 W 

I iei jel 


V(K, {5,, a,}) = s ‘ s i ~ B Y,Y1 s ‘- 

i€l j€J 1 i€l 


( 4 ) 


(5) 
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We next define the expectation value 

E», A({ S I< <T/})exp[-H 0 (K, {S,, n,})\ 


<M{S,})) = 

Now note that 


£ exp[-ff 0 (K, {S,.a,})] 


( 6 ) 


exp[~H(K L , {S;})] = ( ^ exp[-//o(K, {S,,tT,})} j (e v ) 


cri 


Y e~ Ho J (e< v > + 5[<^ 2 >-<^) 2 ] + ...). 

x °i ) 


(7) 


We can perform the summations in the unperturbed part to get 

[Zb(K)p = £exp[-ff 0 (K, {S,,<7,})], (8) 




where M is the number of blocks and Zo(K) is the partition function of a 
single block and is defined as 

Zb( K ) = Y eX P[*( S l S 2 + S l S 3 + *2*5)] = ^ + 3e ~ K ‘ ( 9 ) 

oi 

The interaction is Vjj — — K(s \s J z + 44)- The average value of a single 
spin index is 


(4) = M K ) 1 Xl 5 3 eX P[^( 5/ l 5 2 +5 2‘ S 3 +44)] 


°1 

= Zo(K)- l S,(e >K +e~ K ). 

If we take the logarithm of Eq. (7), we obtain 


(10) 


H(K l , B l , {5 7 }) = Min (Zo(K)) + (V) + - [(V 2 ) - (V 2 )] 


_ / -L e ~K \ 2 

= Mln(Zo(K))-2K^ e3K + 3e _ K j S,Sj (11) 


____ ( p3K _i p -K 

~ 3 Y1[^T3 lr«} SlB + 


where we have retained terms up to (V). Thus the Hamiltonian for the blocks 
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[ has been written in the form 

I 

H(K l ,B l ,{S,}) = -K l J2 s i s J - BlJ2 S '- (12) 

J IJ 1 

1 If we now compare Eqs. (1) and (12), we find 

/ e M+ e -K \ 2 

; (13) 

i 

| and 

I / e 3 * + e -K \ 

! = 04) 

I X ' 

I 

[ Fixed points occur for ( K * =0,5* =0) and for (K* = Kf,B* =0) where Kf 
| is the solution to the equation 

| 1 = ( e* + e~* \ z 

I 2 \e* K f + 3e- K f) ’ { } 

Equation (15) has eight solutions, only one of which is real and positive. 
This solution gives Kf = |ln(l + 2\/2) w 0.3356. Thus the fixed points 
occur for (K* = 0, B* = 0) and for (K* = 0.3356, B* = 0). 

Let us first consider the fixed point, ( K * = 0, B* = 0). If we let 
K = 6K and B = 8B and linearize Eqs. (13) and (14) in 6K and SB, we 
obtain 


6K l = 0.5 SK and SB L = 1.5<55. (16) 

This fixed point corresponds to temperature, T = ocK. 

Let us next consider the fixed point, ( K * = 0.3356,5* = 0). If we let 
K = 0.3356 + SK and 5 = SB and linearize Eqs. (13) and (14) in SK and 
<55, we obtain 


SK l = 1.6236AT and SB L = 2. 121 SB, 


(17) 


so the eigenvalues are A k = 1-623 and \ B = 2.121. The Widom scaling 
exponents become 

p= 2SSr a441 and (,8) 
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The critical exponents are 


a = 2 - - = -0.27, (3 = - — ^ = 0.72, 

P P 

2q — 1 q 

7 = — = 0.83, and <5 = — = 2.2 

p i - q 


(19) 


The exact solution of this two-dimensional Ising model yields (A^) exact = 1 .73 
and {^b) exac t = 2.80. Thus, we are close for A k but our calculation of \ B is 
not very good. It is possible to carry the calculation to higher orders in (V n ). 
In so doing, more elements of the vector, K, are introduced and better 
agreement with the exact results is obtained. 


► SPECIAL TOPICS 

► S8.A. Critical Exponents for the S 4 Model [3, 7, 8] 

Let us consider a ^-dimensional cubic Ising lattice with N lattice sites. The 
partition function can be written in the form 


m = £ exp K££ ^n^n+e (8.98) 

{*} \ •> e ) 

where K = f3J is the effective interaction and e is the vector indicating the 
positions of various nearest neighbors of the site, n. The summation is over all 
possible configurations of the lattice. We can change the summation to an 
integration if we introduce a weighting factor, W(S n ) = 6(S^ — 1). Then Eq . 
(8.98) can be written 


Z ( K )= IlP dS ^ W ( S ^ 

[m Loo 

The partition function is now in a form which allows some generalization. 

Let us choose the weighting function in a slightly different manner. We will 
let 

w(S n ) = <r ( * /2)5 “-<. (8.100) 

If we choose b = —4m, the W(S n ) takes the simple form 

W(S n ) = e- u(s «~ l}2 , 


exp 


K ^ ^ ^ ^ SnSn+e 


(8.99) 


(8.101) 
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and gives a fairly good approximation to the Ising partition function, Eq. (8.98) 
since it is peaked at S n = ±1 (cf. Fig. 8.3). With this weighting factor, the 
partition function takes the form 


z(K,b, U )= n [°° ds ' 

I m J-oo 


exp 


*EE 5 “ 5 "+'-EU 5 n + «S 4 


(8.102) 


The system now has an effective Hamiltonian 

f b 


H — K E E SnSn +* + E ( 2 + uS " 

n e n 

= 2 K E E( 5n +« ~ ^ n ) 2 + (j£ ~ 2d ) 


+«E 5 »’ 


(8.103) 


where the sum is restricted to positive values of e and ^ = d for a cubic 
lattice. The system described by Eq. (8.103) is called the S 4 model of the Ising 
system. 

It is useful now to rewrite the Hamiltonian and partition function in terms 
°f continuous variables. To do this we first introduce a spin field. Let S n denote 
the spin at point x = n a (we assume an infinitely large cubic lattice with 
spacing, a, between lattice sites). Then we introduce the Fourier amplitude, 

^(k), 


S(k) = a d S n e ik na . (8.104) 

n 

The components, k,(i = 1, . . . , d), of the wavevectors, k, can only take on 
Values in the range —§<&<< f since wavelengths smaller than the lattice 
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spacing are not allowed. It is also useful to note the identities 

/a\dr /a .. r /a . 


/a\dr/ a r/° 

(— ) dk !••• dk d e ikna = 6 nfi (8.105) 

V27r/ }-n/a J-n/a 


a d Y^ / e ikna = S(k). (8.106) 

n 

From Eq. (8.104), we can define a continuous spin field 

( 1 \ d t-nja rt/a 

— dk,-\ dk d <* x S( k). (8.107) 

i7r / J-7T /a J—Tt/a 

The spin field, S(x), is dimensionless. Note also that S(x = n a) = S n , but 5(x) 
need not be zero between lattice sites. 

Let us now rewrite the Hamiltonian in terms of the spin field, 5(k). First note 
that 

£L0W -S„) 2 = (ri-) [dk^S(k)S(-k)| e fl ‘“ - 1\ 2 . (8.108) 


= (^) j dk 2 J rfk3 | dk4 

x 5'(ki)5(k2)5'(k3)5'(k4)(5(ki + k 2 + k 3 + k 4 ). 


The Hamiltonion then takes the form 


\ / •* e 


x 5 , (ki)5(k 2 )5(k3)5(k 4 )5(ki +k 2 + k 3 +k 4 ). 


(8.109) 


(8.110) 


Near the critical point, spin fluctuations begin to exhibit long-range order. 
Therefore we only need to be concerned with long-wavelength (small- 
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wavevector) phenomena. If we note that |e ,k ea — lj 2 ~ k 2 a 2 for small k (note 
that 1^ ' e | 2 = ^ 2 )’ then the Hamiltonian can be written 


»-Ks) 




dk(Ci + C 2 /k 2 )|S(k)| : 

Ad 


dk\ 


dk 2 dk 2 1 


d k4 


(8.111) 


x 5(ki)5'(k2)5(k3)5'(k4)<5(ki + k 2 + k3 + lLt), 


where 


-2dJKcT d . (8.112) 

It is useful to rescale the spin fields. We will let S'(k) = S(k)(Ka 2 ~ d ) 1 ^ 2 , 
r = (b/K — 2d)aT 2 , and u' = u(Ka 2 ~ d )~ 2 . The Hamiltonian then becomes 


_ * 0 ,- 4+ 2 


C 2 = Ka 


and 


C, = 1 1 


H(r, u\ {S'}) = \ Q-) j dk(r + ^)|S'(k)| : 


+ h'(;L^ |dki|dk2 dk 3 Jrfk4 
x S'(ki)S'(k 2 )S'(k 3 )S'(k 4 )«(ki + k 2 +k 3 + It,)- 


(8.113) 


The partition function is given by 

Z(r, «',{$'}) = |DS'e" (r ’“''f s ’i ) , (8.114) 

where DS' now denotes a functional integral over values of the spin variables, 
^'(k). Equation (8.113) is the starting point for the renormalization theory, for 
the S 4 model. When u' = 0, the model is called the “Gaussian model”. In order 
to find a scaled Hamiltonian for the system, we follow a procedure similar to 
that for the triangular lattice in Exercise 8.1, but now we work in wavevector 
space and not real space. 

Our first step is obtain a block Hamiltonian. In order to do this, we will 
introduce a scaling parameter. L > 1, and divide the momentum intervals 
0 < ki < n/a into short-wavelength intervals ir/La < ki < ir/a and long- 
wavelength intervals 0 < ki < w/La. We then separate the spin fields into 
their long-wavelength and short-wavelength parts: 

S'(k) = y L (k) for 0 < ki < f-{i = 1, . . . ,d) 


(8.115) 
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and 

S'(k) = 0i(k) for ^-<ki <-(;= (8.116) 

La a 

From Eq. (8.107), we introduce the corresponding spin fields ^l(x) and 07, (x). 
The spin field «^l(x) is defined as 

( 1 \ ^ f VLj 

— ••• rfk t ^ L (ky 1tx (8.117) 

2 V J -n/La 

and is slowly varying over a region the size of the block, while the spin field 
07, (x) is defined as 

fft(x) = ( 2 b) j (8.ii8) 

where ^dk = ^ dk + dk and 07, (x) is rapidly varying inside a block. 

We now decompose the quadratic part of the effective Hamiltonian into its 
long-wavelength and short-wavelength parts and define the quantities 

1/1 \ d f 1r / La 

^o({^i},r)=- — dk(k 2 + r)y L (k)y L (-k), (8.119) 

2 \ 2 V J-n/La 

1 / 1 \ d 717,0 

jfo ({ol}, r) = 2 j dk (k 2 + r)<r L (k)a L (- k), (8.120) 

and 


/ i \ 4d r rV« 

V({Sf L ,a L },u') = u'^—J j---j rfk 1 rfk 2 ^k 3 ^k 4 


x 5 , (k 1 )5'(k 2 )5'(k 3 )5'(k 4 )(5(k 1 + k 2 + k 3 + k 4 ), 

(8.121 


where ^ L ( k) and 07,(k) are defined in Eqs. (8.115) and (8.116). We now can 
introduce an expectation value defined as 


m^ L })) = 


\Da L e-*°^A{{a L ,<? L }) 

jDcrie - ^ 0 ^^) 


( 8 . 122 ) 


and write the partition function in the form 
Z(r,u') = 

= if Da L e-^ o{{tTL}) ] f Dy L e-^ o({yL}) - (v ^ 


(8.123) 
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where we have introduced a cumulant expansion are before. We are only 
concerned with those parts of the partition function which contribute to the 
singular part of the free energy. Thus, the term in brackets, which comes from 
short-wavelength variations, can be neglected and the effective Hamiltonian 
takes the form 




d fir /La 

dkik 1 + r)^ L (k)^ L (-k) 

(8.124) 


+ (v)-^[(v 2 )-(v) 2 ] + 


After a scale transformation, we seek a new Hamiltonian of the form 
= P dk L (k? L + r L )y L (k L )y L (-k L ) 

•ir /a 

dkn dk.2L dkji dkxi (8.125) 

—ir /a 



X ^L(kl L )^(k 2L )^ L (k 3L )^ L (k4 L ) 
x 6(k 1L -f k 2 L + k 3z , + k4i). 


If we can write Eq. (8.124) in the same form as Eq (8.125), then we can obtain 
the new coupling constant, u L . We first compute the expectation value, ( V ), and 
obtain 


(V) = Da L e ~^ aL ^ j | Da L e (J-j 


4 d 


dkidk 2 dk^dk4 


x £,(k 2 )y /.(k^eS* 17 £,(k4)^(ki -+■ k 2 k 3 + k4) 


?-n /La 

fir /La 

fir / a 

fir /a 

dki 

dk 2 

d k 3 


J-ir/La 

f-ir/La 

Jir/La 

tir/La 


x 0L(k 3 )<7L(k4)<5(ki k 2 -t- k 3 -\- k4) 


+ u' 


ir/a 


ir /La 


Jki Jk 2 rfk 3 Jk4<r L (ki )<7 L (k 2 )c7 L (k 3 )c7L(k4) 


x <5(ki -j- k 2 -j- k 3 4- IC4) ^ . 
The factor 6 


(8.126) 


enters because there are six terms with two factors of £f L ( k) and 
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two factors of a L {k). They each give identical contributions. The last term can 
be neglected since it contains only short-wavelength contributions and 
contributes to the regular part of the free energy. To evaluate the second term, 
we use Eq. (7.27) to obtain 


" ^3 + kl)- 
K-i ~r T 


(8.127) 


Then the singular part of ( V ) takes the form 

\4d /» i>n/La 


(V), = u' 


2tt 


d ki • • • d k4 

—n/La 

x ^ L (ki)^ L (k 2 )^L(k3)^L(k 4 )<5(k 1 + k 2 + k 3 + 1^) 

3 d pir/La 


/ 1 \ f f 7r/Lfl 

+ 6 u'(— J (•■■I dk,dk 2 ^ L (k,)^L(k2) 

x<(k, + k2) C" k ^T7' 


(8.128) 


We see that (V) has a structure similar to the original Hamiltonian, but it yields 
no correction to the quartic part. The correction to the quartic part can be 
obtained from the second-order terms in the cumulant expansion. 

Many terms contribute to the second-order cumulant, \[(V 2 ) — (V) 2 ]. To 
obtain explicit expressions for them, we must use Wick’s theorem (see Ex. 4.9). 
We will not attempt to work out all the details here but will summarize the 
results. As Wilson and Kogut [8] have shown, the only term of interest in 
2 Vy 2 ) ~ (y) 2 ] is the correction to the quartic term in the effective Hamiltonian. 
They have shown that corrections to the quadratic terms can be neglected. 
Furthermore, contributions to the second-order cumulant coming from pairwise 
averages of spins which are both located on the same potential, V, give no 
contribution since for those terms ( V 2 ) = (V) 2 and they cancel out. Thus, we 
only retain terms which have at least one pairwise average between spins on 
different factors, V. The only term which involves four spins, Sf L ( k), and which 
contributes to the second-order cumulant is 


~(V\ 


2 ' ' /quartic 2 


l-l 


Da L e~^ o{{<TL}) 

n/La 




8 d 


{ r c n / La r r 7r / a _ ~ 

36m' 2 ' • • dkidk 2 dk 7 k8 • • • Jk 3 ^k4^k5^k6 

J J -n/La J Jn/La 


X ^L(ki)^(k2)<7L(k3)o’L(k4)<7L(k5)cr L (k6)^ L (k 7 )^ L^s) 


+ 


} 


(8.129) 
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The origin of the numerical factor, 36, can be understood if we note that there is 
a factor, in front of (V / 2 ) qua rtic, there are six terms in each factor V with two 
(f L 's and two ol ’s, and there are two different ways of taking pairwise 
averages for each term in V 2 {\ x 6 x 6 x 2 = 36). 

From the results of Exercise 4.9 and Section 7.C.1, we obtain 


Uv 2 ) c = 1^1 36 «' 2 


' quartic 


1\ 


5 d 


it/La 


—Tt/La 


d kj • • • d k4 


2irJ 

x ^ i (k,)y I .(k 2 )y ; .(k 3 )^t(k 4 ) 6(k, + k 2 + k 3 + k 4 ) 


'ir /a 
ir/La 


d k 


k 2 + r) \ (k 3 + k 4 - k) 2 + r 


-t- other terms. 


(8.130) 


If we combine contributions from Eqs. (8.128) and (8.130), the Hamiltonian 
fL,ui ) in Eq. (8.124) takes the following form: 


l}, r L , u L ) - - ( — 


i ( i \ d r /La 


—ix/La 


dk^ L (k)^ L (-k) 

12 M ' r /a 


l i2 , 12 m' r /a - i 

X l (k+r) + (^ L dk ^Tr 

/ 1 \ 4d r rl La 

+ fc) I 'L /t / ki ' <ik4 

X ^ L (k 1 )^ L (k 2 )y’ L (k3)^L(k 4 ) <5(k! + k 2 + k 3 + k4) 


X M 


+ 


36m /2 r /a 


(27 r) J 7T /La 


f ^(w~ 

J tt/La \k 4" r 


(k 3 -j- 1*4 — k) + r 


(8.131) 


where we have neglected some contributions (terms not containing block spins), 
some second-order quadratic terms, and terms containing six or eight block 
s Pms. Also, we neglected all contributions from higher-order cumulants. 

To obtain Eq (8.131) in a form similar to the original effective Hamiltonian, 
Wc must make a change of scale. We let 


k L = Lk 


and 


(8.132) 


^(kz.) = ;.(k). 


(8.133) 
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Then we find 


l({^ l}, r L, ml) 

1 Z 2 L~ d r /a 


2 (Itt) J — ?r/< 


J 7r/a 

dk L <? L (k L )<? L (-k L ) 

—it la 


i 2 w' r /a ~ i 


dk 


k\ 

L 2 + Y + {2ir) d J n/La " & + r 
1 \ 4 ^ Z 4 


+ hr- 


27ry (Z/*)' 


f 

Z 4 /• j*»/a 

(Z/*) 3 J Jwa' 


(8.134) 


dkiL x • • ■ x dk^ L 


X S? Likli)^ L{k2L,)& ? l(^4l) <5(klL + + k 3 ^ + IMl) 

# 36m /2 r /a 


(27 r) Jir/La 


f 

J ir/La \k 2 + 


+ »V Wk 3 + k 4 -k) / + r 


Equation (8.134) is the transformed Hamiltonian that we have been looking for. 
If we let Z = L l+d / 2 , then we obtain the following expression for rj,: 


fL 


r + 


12m 


;j; 


7r/a _ i 

dk 

(27T) d J IT /La k 2 + r 


(8.135) 


The expression for u L requires more discussion. To obtain an expression for u L 
which is independent of the k integrations in Eq. (8.134), we must make an 
approximation. If we let k 3 + k 4 = 0, we obtain 


jr 0 

u l — L 


(2TT) d J ir/La V ^ 2 + r ) 


(8.136) 


where = 4 — d. This approximation is made every time we change the scale 
of the Hamiltonian. Eqs. (8.135) and (8.136) are the recursion relations we have 
been looking for. They are correct as long as \S j 1 (this will become clearer 
below). 

To find the fixed points of Eqs. (8.135) and (8.136), we will turn them into 
differential equations. We first write the recursion relation for a transformation 
from block L to block sL, 


r S L = s 


12 u L 

rL + 7^Td 
(2?r) 


r*/« _ 1 

dk 


■«,sa k 2 + r L 


and 


MjL — & 


UL 


36m? 


ir/a 

dk 

n/sa 


k 2 + r L 


(8.137) 


(8.138) 
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where s is now considered to be a continuous parameter. Let s = 1 + h for 
/j < 1. Then, Eqs. (8.137) and (8.138) can be approximated by 

12u ( 1 \ 

r (\+h)L = ( l + 2h ) r L + -—h \ ^ k \l2 Vr ) (8.139) 

(27t) J(n/a)(l-h) \* + r h) 

and 

UL ~~^TZd d\k 2 4-r ) ' (8-140) 

(27t) J (*/a)(l-A) V* + 

We next evaluate the integrals. We can write for small h 


i r /a 


[2ir) d J (n/a){\-h) W + nJ (2*? [Qf+nJ J (n/a){\-h) 


i \ r /a 


1 Ah 
n 


where A is a constant. Thus, Eqs. (8.139) and (8.140) become (to first order in 


'■(n-t)i - n. = 2hr L + - !~r ~ 
((a) ^ 


(8.141) 


u (l+h)L ~ U L ~ 


?>u 2 L Ah 


0 we next divide by hL and take the limit h — * 0, we find 


r dr L 

l n - 2r * + 


(©■«) 


(8.142) 


(8.143) 


aL 


« 3 '« 


(8.144) 
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Now let t = In L and choose the units of a such that 7 x/a = 1. Then 


dr „ Am 

— = 2 r + 

dt 1 + r 


(8.145) 


and 


du 3 Am 2 

— = Su =■ . 

dt (1 + r) 2 


(8.146) 


Higher-order terms can be neglected as long as \S\ -C 1. The fixed points of 
Eqs. (8.145) and (8.146) occur when 


and 


dr* Au* 

~dt =2r+ T+7* 


= 0 


(8.147) 



3Au* 2 
(1 +r*) 2 


(8.148) 


There are two sets of fixed points, ( u * = 0, r* = 0) and (w* = S/3 A, 
r* = —S/6). If we let 6u = u — u* and 6r = r — r*, then the linearized 
transformation becomes (to first order in u* and r*) 

/ dbr 
dt 
d6u 

\~~df 


_ (2 -Au* A(1 -r*) \ (6r 
0 S’ — 6 Am* / \^m 


(8.149) 


and we obtain the eigenvalues X r = 2 — Au* and \ u = S — 6Au* for small u* 
and r*. Thus, X r is a relevant eigenvalue. For the fixed point ( u * = 0 and r* = 0) 
the eigenvalues are A r = 2 and A u — S (this is just the Gaussian fixed point), 
and for the fixed point (m* = S /3A and r* — —S/6) the eigenvalues are 
X r = 2 — S/3 and A M = — S. For d > 4, S < 0 and the second fixed point is 
repulsive and therefore unphysical since both eigencurves are directed away 
from the fixed point and it can never be reached. Thus, for d > 4, the physics is 
governed by the Gaussian fixed point. For d < 4, the Gaussian fixed point is 
repulsive and for this case the fixed point ( u * = <f/3A,r* = —S/6) is the 
physical one. For d = 4, the fixed points coalesce. In Fig. 8.4 we sketch the 
trajectories in the neighborhood of the two fixed points. 

We can now obtain expressions for the critical exponents a and u, but it 
requires a bit of algebra. To obtain the equations for the eigencurves of the S 
model, we must rewrite the differential equations in Eq. (8.149) in terms of L. 
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(a) ( b ) 

Fig. 8 . 4 . A sketch of trajectories in the neighborhood of the physical fixed points for the 
S 4 model: (a) For d > 4 the Gaussian fixed point governs the physics; (b) for d < 4 the 
fixed point ( u * = e/3 A, r* — -e/6) governs the physics (e = 4 — d). 



We first write Eq. (8.149) in terms of its eigenvectors and eigenvalues. (Note 
that the left and right eigenvectors will be different since the matrix on the 
right-hand side is not symmetric (cf. Section 5.C). For our purposes, it is not 
necessary to find explicit expressions for the eigenvectors. If we let 8u\{t) 
denote the relevant eigenvector, we can write 

^lW = (2-Ak*)«h,W- (8150) 

at 

The solution to Eq. (8.150) can be written 

6ui(t) = e {2 ~ Au ^6ui( 0). (8.151) 

U we now remember that t = InL, we find 

8u^ L = e {2 - Aut)lnL 8ui (8.152) 

so that A! =exp[(2-Aw*) In L\. We then make use of Eq. (9.E.18) to 
obtain 


P = 


2 - Au A 


d 


(8.153) 
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Using Eqs. (8.50) and (8.153) we obtain 


a 


= 2 - 


d 

2 -Am* 


and using Eqs. (8.77) and (8.154) we obtain 

1 

v = . 

2 — Am* 


(8.154) 


(8.155) 


Since the fixed points which characterize the system are different for d > 4 and 
d < 4, the critical exponents will also differ for those two cases. 

Let us first consider the case d > 4. Then u* = 0 and we find 


a 



S 
2 ’ 


where we have used the fact that S = 4 — d. We also find 


(8.156) 


1 

v = 2- 

When d < 4, n* = S/3 A, and we find 

a- 6— S ~ 6 

and 

1 S 


(8.157) 


(8.158) 


(8.159) 


The magnetic critical exponent can also be computed and the result is 

£«3 + S (8.160) 

as for the case d > 4. The other exponents can be obtained from the identities in 
Eqs. (8.51), (8.52), and (8.78). In Table 8.1 we compare the results of the S A 
model with the experimental results, with mean field results, and with the 
results due to exact calculations for the Ising model. The first thing to note is 
that, for d — 4, the mean field theories and the S 4 model give the same results. 
For d = 3, the S 4 model gives very good agreement with the experimental and 
Ising values of the exponents (the exact results for the three-dimensional Ising 
are obtained from a numerical calculation). However, there is really no reason 
why it should. We have retained only the lowest-order approximation for the 
exponents in the S expansion. If we take S = 1 (as it is for d = 3), then the 
expansion need not converge. In fact, higher order terms in the series give large 
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contributions and ruin the agreement. For this reason the $ expansion is thought 
to be an asymptotic expansion — that is, one for which the first few terms give a 
result close to the result obtained by summing the entire series. 

One may well ask the meaning of an expansion about d = 4 and in terms of 
noninteger d, when real physical systems correspond to d = 1, 2, or 3. However 
consideration of these “unphysical” values of d provides us with a theoretical 
tool for understanding the role of the dimension of space in physical 
phenomena, and it has led to a major advance in our understanding of critical 
phenomena. 


► S8.B.EXACT SOLUTION OF THE TWO-DIMENSIONAL 
ISING MODEL [14, 15] 

The two-dimensional Ising model is one of the simplest systems that exhibit a 
phase transition and one of the few that can be solved exactly. The phase 
transition in this system is an order-disorder transition. It has served as the 
paradigm system for describing order-disorder transitions in a variety of 
contexts. In physics it has been used to model the phase transition from the 
paramagnetic to the ferromagnetic state (the Curie point) in magnetic crystals. 
However, it has also had many applications outside of physics. It has been used 
to model learning [16] and information storage [17] in neural networks. In 
molecular biology, it as been used to model cooperative conformational 
changes due to ligand binding in macromolecules [18] and heat denaturation of 
DNA [19]. It has also been used in sociology to model cultural isolation [20]. In 
this section we shall use it in its physics context to model order-disorder in a 
spin system. 

An analytic expression for the partition function of an Ising system can be 
obtained in one and two dimensions, but no one has succeeded in solving it 
analytically in three dimensions. In one dimension it does not have a phase 
transition. In two dimensions, it does have a phase transition. In Section 7.F we 
considered the one-dimensional Ising model and the mean field approximation 
to the Ising model. In this section we will consider the case of a planar Ising 
spin lattice and show how an analytic expression for the partition function may 
be obtained and how the phase transition appears [21]. 


► S8.B.1. Partition Function 

Let us consider a planar lattice of N spin— objects with magnetic moment, fi. A 
planar lattice is one which can be laid out flat in a two-dimensional plane 
without any bonds intersecting. For example, a lattice with periodic boundary 
conditions is not planar. Each lattice site can interact with the magnetic fields 
of its nearest neighbors. Some examples of planar lattices are shown in 
Fig. 8.5. 
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Fig. 8.5. Three types of planar lattice, (a) Square lattice, (b) Triangular lattice, (c) 
Hexagonal lattice. 

The Hamiltonian for a two-dimensional planar lattice can be written 

H = 1), (8.161) 

{« 

where £/,-,•} denotes the sum over nearest neighbors, i and j, J is the magnetic 
interaction energy between nearest-neighbor pairs, and s, is the z component of 
spin at the zth lattice site. The factor, —1, in Eq. (8.161) merely shifts the zero 
point of energy and will prove useful in subsequent calculations. For spin— f 
objects, Si = -f 1(— 1) if the spin of site i is oriented in the positive (negative) z 
direction. If J > 0, the lattice will have its lowest energy when all the lattice 
sites have spin-oriented in like manner. If J < 0, the configuration in which 
neighboring spins are oriented opposite to one another will be favored. 

The partition function for this planar lattice can be written 

z,v(r) = £>xp ( - 1) ] 

ac - \(0) ) (8.162) 

— e -N m K ^ . .. ^ 

si=±l s N =± 1 ( ij ) 

where K = fiJ , E ac . denotes the sum over all 2 N possible different 
configurations of spin on the lattice, and N nn is the number of nearest neighbor 
pairs. The sum, and the product, are taken only over nearest-neighbor 
pairs. Let us now note that since j, = ±1, we can write 

e Ks ‘ Sj = cosh(£) + SiSjSiiih(K) = cosh(AT)[l + tanh(i5T)j,-j,-]. (8.163) 

Therefore, the partition function can be written in the form 

Z N (T) = e~ NnnK (cosh(K)) Nnn " + tanh(^)^]. (8.164) 

Si=±l S N =±1 (ij) 
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To evaluate the partition function, we must expand the product and then sum 
over each spin variable, 57. There are several properties of the spin that should 
be noticed and that simplify the expression for the partition function. First note 
that 


sf 1 = 2 and ^ j ( 2w+1 = 0, (8.165) 

i=± 1 i=± 1 

where n is an integer (including zero). Thus, only even powers of the spin 
variable can appear in the partition function. Also, only nearest-neighbor pairs 
can appear and a given nearest-neighbor pair can appear only once. Each 
summation over a spin variable yields a factor of 2. Expansion of the product in 
Eq. (8.164) yields a sum of terms depending on varying powers of tanh(A). 
After summation over spins is performed, each term in the sum can be 
represented by a picture which we call a closed graph. We then can write the 
partition function in the form 

Z n {T) = 2 N e~ NnnK (cosh. (K)) Nm ]T^(all closed graphs) (8.166) 

The closed graphs are obtained as follows: 

(1) Lay out the grid of lattice sites as they appear on the original planer 
lattice (cf. Fig. 8.5) but do not yet draw connections between lattice 
sites. 

(2) Find all different ways to draw nearest-neighbor bonds between lattice 
sites so they form closed loops (any number of closed loops including 
zero). We only require that no nearest-neighbor bond appears more than 
once. There may be more than one closed loop on a graph, and they may 
intersect, as long as they have no bonds in common (cf. Fig. 8.6) for an 
example of an allowed and forbidden graph). 



Fig. 8.6. Some closed graphs on a 3 x 3 square lattice, (a) An allowed graph, (b) A 
forbidden graph because two closed loops share a bond. 
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(3) A given graph contributes a factor, tanh r (X), where r is the number of 
bonds in the graph. 

An example of the use of these rules is given in Exercise 8.2 

In writing Eq. (8.166), we have made progress toward obtaining an 
analytic expression for the partition function, but we still have a ways to go. 
In the next section, we will change the subject slightly and show how to 
represent an antisymmetric matrix in terms of a directed graph. We will 
also show how to represent the determinant of an antisymmetric matrix 
(actually its square root) in terms of dimer graphs. Then in Section S8.B.3 
we will make a connection between the closed graphs described above and 
dimer graphs. 


j 

■ EXERCISE 8.2 Consider the 3x3 spin lattice shown in the figure. | 
Assign a coupling constant, J x {J y ) to horizontal (vertical) bonds. Obtain an 
analytic expression for the partition function. 



Answer: First note that the lattice has N = 9 lattice sites and N nn 
nearest-neighbor pairs. The partition function can be written 


= Q 6Ky gtC x S\S2 gKyS\S^ ^K X S2S3 ^KyS2Ss gKySjS^ 

5] — i 1 i?9 = i 1 

X gK x S4S$ gKjrSsSs gKyS$s% gKyS(,s$ gK x sis% gK x s%si$ 


= 12 

( 1 ) 


where K x = (3J X and K y = (3J y . We may write this as 

Zg = 2 9 e~ 6Kx e~ 6Ky cosh 6 (K x )cosh 6 (K y ) ^(all closed graphs). (2) 
Next we note that 
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► S8.B.2. Antisymmetric Matrices and Dimer Graphs [22] 

Let us now return to the planar lattice whose partition function we wish to 
compute. We again draw a picture of the planar lattice, but now associate a 
direction to each bond. The resulting picture we call a directed graph. We can 
associate an antisymmetric matrix, A with a directed graph by assigning to the 
directed bond between lattice sites, i and j, a matrix element, ay. Only nearest- 
neighbor lattice sites give rise to nonzero matrix elements. We can associate the 
following matrix elements with the directed graph: 


a 

for bond directed i - 


—a 

for bond directed i <- 

~j, 

= 0 

if i and j have no bond 


(8.167) 


Here a = tanh(£). It is possible to generalize Eq. (8.167) and assign different 
weights, a, to different bonds. For the case of a square directed graph, for 
example, for the horizontal bonds we let a = x = tanh(£*) and for the vertical 
bonds we let a = y = tanh (£■>,) (such a case is considered in Exercise 8.3) 

If we want all the terms in the determinant of the antisymmetric matrix, A, to 
be positive, then the choice of directions given the nearest-neighbor bonds must 
be carefully made. Kasteleyn [22] showed that for graphs with an even number 
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triangular lattice, (c) Directed hexagonal lattice 


of lattice sites, directions can be chosen in such a way that all terms in the 
determinant of the antisymmetric matrix, A, are positive. This is important when 
we later connect the antisymmetric matrix generated by a directed graph to the 
partition function. The direction is chosen so that in going around any even loop 
on the directed graph in a clockwise manner, one encounters only an odd 
number of clockwise directed bonds. An even loop is a closed path containing 
an even number of lattice sites and enclosing an even number (including 
zero) of lattice sites. An even loop never passes through the same lattice site 
twice. Correct directions for some directed graphs are shown in Fig. 8.7. 
An example an antisymmetric graph obtained in this manner is given in 
Exercise 8.3. 


, ■ EXERCISE 8.3. Consider the directed planar lattice shown in the 
i accompanying figure. Write the 12 x 12 antisymmetric matrix, A, associated 
with this lattice, and compute the determinant of the matrix, A. 
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Answer: The antisymmetric matrix associated with the accompanying graph 
is 


/ 0 

JC 

0 

y 

0 

0 

0 

0 

0 

0 

0 

°\ 

— JC 

0 

JC 

0 

—y 

0 

0 

0 

0 

0 

0 

0 

0 

— JC 

0 

0 

0 

y 

0 

0 

0 

0 

0 

0 

—y 

0 

0 

0 

JC 

0 

y 

0 

0 

0 

0 

0 

0 

y 

0 

— JC 

0 

JC 

0 

-y 

0 

0 

0 

0 

0 

0 

—y 

0 

— JC 

0 

0 

0 

y 

0 

0 

0 

0 

0 

0 

—y 

0 

0 

0 

JC 

0 

y 

0 

0 

0 

0 

0 

0 

y 

0 

—x 

0 

X 

0 

-y 

0 

0 

0 

0 

0 

0 

-y 

0 

— JC 

0 

0 

0 

y 

0 

0 

0 

0 

0 

0 

-y 

0 

0 

0 

JC 

0 

0 

0 

0 

0 

0 

0 

0 

y 

0 

— JC 

0 

JC 

V o 

0 

0 

0 

0 

0 

0 

0 

-y 

0 

— JC 

0/ 


where x = tanh(A*) and y = tanh(AT >> ). The determinant of this matrix is 

det(A) = 16jc 8 y 4 + 48jc 6 y 6 + 44jc 4 y 8 + \2x 1 y™ +y 12 

= (4*y + 6*y + y 6 ) 2 . ^ 


The determinant, det(A), of an antisymmetric matrix, A, can always be 
written as the square of a polynomial containing the weighting factors, a (or 
whatever weighting factors are assigned to the graph). Furthermore, it is 
possible to express y / det(A) (A obtained from a directed graph) in terms of a 
sum of graphs which we shall call dimer graphs. A dimer graph is obtained in 
the following way: 

(1) Lay out the grid of lattice sites (but do not yet draw bonds) as they 
appear in the original directed graph. 

(2) Connect all pairs of lattice sites by bonds so that each lattice site is 
attached to only one bond and all lattice sites are members of a bond. 
The lattice consists of a set of unconnected bonds (we call them dimer 
bonds). 

The quantity, >/det(A), can be represented by the set of all different dimer 
graphs which may be obtained from the original planar lattice. Thus, 


\/det (A) = ^(all different dimer graphs). 


(8.168) 
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Once a dimer graph is drawn, an analytic expression can be associated with it. 
If a graph has s dimer bonds, then we can assign a factor, tanh^A'), to the 
graph. As we show in Exercise 8.4, we can also generalize this to give 
different weights to different bonds. For example, let us consider a square 
dimer graph with different bond strengths, K x , for horizontal and, K y , verticle 
bonds. If the graph has r horizontal dimer bonds and s verticle dimer bonds, we 
associate a factor, tanh r (X A; )tanh s (X ) ,) with t he grap h. In Exercise 8.4, we 
construct the dimer graphs that represent \J det( A) for the antisymmetric 
matrix, A, obtained in Exercise 8.3. 


I EXERCISE 8.4. Consider the directed lattice shown in Exercise 8.3 and 
i the antisymmetric matrix, A, ge nerated by that directed lattice. Draw all 
! dimer graphs that represent yj det(A) and show that they do indeed 
■ reproduce the analytic expression obtained in Exercise 8.3. 

; Answer: For the an tisymm etric matrix, A, generated in Exercise 8.3, we 
want to show that ^/det(A) = ^ [all dimer graphs]. Note that 

| ^ [all dimer graphs] = 


i I + ~ i + i 

V *V *V zV *V zV *v 

( 2 ) 

Thus, if we add the contributions from the various graphs, we find 
y^[all dimer graph] = 4 x 4 y 2 + 6y^y A + y 6 , 
in agreement with Exercise 8.3. 



Ill III III “I 
in + "i + ir: + iii 

y 6 x 2 y A x 2 y A x 2 y A 


► S8.B.3. Closed Graphs and Mixed Dimer Graphs 

Kasteleyn [22] and Fisher [23] have found a way to write the partition function 
in terms of dimer graphs and thereby have found a way to obtain an expression 
for the partition function in terms of the determinant of an antisymmetric 
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matrix. The method is based on the following observation. Consider a lattice 
with two different coupling constants, J and J *, for example. A bond with 
coupling constant, /*, will contribute factor, 2e~**cosh (K*) and tanh(AT*), 
where K* = (3J*, to the partition function. [Note that the factor —1 in Eq. 
(8.161)] gives rise to this property.] However, note that 


lim 2e ^’coshfK*) = 1 and lim tanh(^T*) = 1. (8.169) 

K*—*oc K *~ »oo 


The next thing to notice is that the partition function of a given planar lattice 
depends on a sum of terms depending on different powers of the weighting 
factor, tanh {K). However, the dimer graphs which are generated from a directed 
graph all depend on the same power of the weighting factor, tanh(A'). Therefore 
we need to construct a directed graph which can generate dimer graphs 
depending on different powers of the weighting factors. The limiting behavior 
in Eq. (8.169) enables us to do this in a unique and unambiguous way for planar 
graphs. 

The expansion of the partition function in terms of dimer graphs requires two 
important pieces of information. On the one hand, we use the theorem of 
Kasteleyn which says that it is possible to draw a directed graph which 
generates an antisymmetric matrix whose determinant has all positive terms. On 
the other hand, we use a method of expanding the vertices of a planar lattice so 
that a directed graph can be associated with it (we call it a mixed directed 
graph ) which produces dimer graphs (we call them mixed dimer graphs ) whose 
analytic expressions are in one-to-one correspondence with the closed graphs 
for that planar lattice. Kasteleyn [21] and Fisher [22] had different ways to 
accomplish this. In this section, we use Fisher’s approach [23]. 

We can construct mixed directed graphs according to the following 
procedure. 


(1) Draw the planar lattice of interest. (Figure 8.5 gives some examples of 
planar lattices.) The planar lattice will have N vertices and N nn nearest- 
neighbor bonds. On the planar lattice we shall call the bonds primary 
bonds. Represent primary bonds by solid lines. 

(2) Replace each vertex of degree q(q > 4) by a cluster of q — 2 vertices 
connected by q — 3 internal bonds. (A vertex of degree, q, is one at 
which q bonds meet.) Represent internal bonds by dashed lines. (See 
Fig. 8.8 for an example of this replacement.) With this replacement, all 
vertices are now either of degree q = 2 or 3. 

(3) Replace each vertex of degree, q = 2, by two vertices connected by 
one internal bond. Replace each vertex of degree, q = 3, by a cluster of 
three vertices connected by three internal bonds. (See Fig. 8.9 for 
examples.) 
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Fig. 8.8. Replace vertices of degree, q(q^4), by a cluster of q - 2 vertices connected 
by q - 3 internal bonds, (a) q = 4. (b) q = 5. 
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Fig. 8.9. (a) Replace vertex of degree 2 with two vertices and an internal bond, (b)-(e) 
Replace vertex of degree 3 with three vertices and three internal bonds. 


(4) Assign directions to all the bonds (primary and internal) so that 
counterclockwise circulation around any even loop yields an odd number 
of clockwise directed bonds. (According to the theorem of Kasteleyn, 
this can always be done.) 

The mixed directed graph which results from steps 1-4 is a collection of 
directed solid and dashed lines connecting vertices of degree 3 or smaller. 

Once a mixed directed graph has been associated with a planar lattice, we 
can associate an antisymmetric matrix, A m , with the directed mixed graph in the 
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following manner. The antisymmetric matrix, A m , is constructed by assigning to 
matrix elements, a,y, the following values: 


#27 S 


f !/<* 

— 1/a 

1 

-1 

0 


for a solid bond directed i — * j, 
for a solid bond directed i <— j, 
for dashed bond directed i —> j , 
for a dashed bond directed i *— j, 
if i and j have no bond. 


(8.170) 


As we found in Section S8.B.2, we can associate a collection of dimer 
graphs (we shall call them mixed dimer graphs ) with each directed mixed 
graph. Dimers can now be solid or dashed depending on whether they occur 
at the position of primary or internal bonds, respectively, on the mixed 
directed graph which generated them. No more than one dimer bond attaches 
to any given vertex. To each dimer bond we assign a factor 1/a, if the 
dimer bond is a solid line and assign a factor 1 if the dimer bond is a 
dashed line. A given dimer graph has a factor (l/a) m (l)” assocated with 
it if it has m primary (solid) dimer bonds and n internal (dashed) dimer 
bonds. 

We can now relate ^/det(A lll ) to the sum over all mixed dimer graphs that 
can be derived from the mixed directed graph which generates the 
antisymmetric matrix, A m . We write 

y / det(A m ) = y^(all mixed dimer graphs). (8.171) 


The mixed dimer graphs that result from a given planar lattice are in one-to-one 
correspondence with the closed graphs which are generated by that same planar 
lattice. The correspondence is “inverted,” however. The primary (solid) dimer 
bonds on the mixed dimer graph correspond to the lack of bonds on the closed 
graph. 

We can now relate y / det(A m ) to the partition function associated with the 
planar lattice. We can write 

Z n {T) = 2 Ar e"^^(cosh(/s:)) ;Vw, aVdet(A ffJ ) 

_ 2 N e~ NnnK {cosh {K)) Nm a s ^P(all mixed dimer graphs), (8.172) 


where s is the maximum number of primary (solid) dimer bonds that can appear 
on any of the mixed dimer graphs. With this definition, the dimer graph in 
which all dimers are primary (solid) bonds actually corresponds to the unit 
graph. In Exercise 8.5 we show these steps for the planar lattice considered in 
Exercise 8.2. 
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r 


0 EXERCISE 8.5. Consider the planar lattice shown 
in the accompanying figure, (a) Construct a mixed 
directed graph for this lattice and construct the 
associated antisymmetric matrix, A m . (b) Compute 
the partition function from this antisymmetric matrix, 

A m . (c) Draw the mixed dimer graphs for this lattice 
and write an analytic expression for the partition 
function. 

Answer: 

(a) We first obtain the directed mixed lattice. We replace the vertex of 
degree 4 by two vertices of degree 3 and an internal line. We then 
replace each vertex of degree 2 by two vertices of degree 2 connected 
by an internal line. We replace each vertex of degree 3 by two vertices 
of degree 3 connected by three internal bonds. The results is a graph 
with 26 vertices. 

This procedure is shown in the graphs below. 



We next give directions to all the lines in the mixed graph so that 
for any even loop , clockwise circulation encounters an odd number of 
clockwise directed bonds. Such a choice is shown below. We have 
also numbered the vertices. 
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(b) Given the numbered mixed directed graph above, we can use it to 
construct an antisymmetric matrix, A m . Choose the matrix elements 
according to the directions of the bonds and their type. Assign a value, 
rtl/jc, to horizontal bonds and assign a factor, to vertical bonds. 
The nonzero matrix elements are a 1,2 = +1, ^ 2,1 = — 1, < 22,3 = H-1/jc, 

03.2 = — 1/*> 03,5 = +1» 05,3 = — 1, 05,12 = +1 /y, 012,5 = — 1 /y, 

01.3 =03,1 = 0 , and so on. The matrix, Ai is too big to write here, 
but it is easy to construct given the directed graph above. Once it has 
been constructed we can find its determinant. It is 

det(A t ) =-^(1 +4^;y 2 + 2*y +2*V + 7*V) 2 (1) 

x y 

The partition function is 

Z n {T) = 2 9 e- 6K *e- 6K ycosh 6 (K x )cosh 6 (K y )x 6 y 6 

x (1 + 4x 2 y 2 + 2*V + 2x 2 y 4 + 7 x 4 y 4 ) 

= 2 9 e~ 6Kx e~ 6Ky cosh 6 (K x )cosh 6 (K y ) 

x (1 + 4x 2 y 2 + 2 x 4 y 2 + 2x 2 y 4 + 7 x 4 y 4 ). (2) 

Let us now draw the dimer graphs. There is one dimer graph for each 
closed graph in Eq. (3) of Exercise 8.2. We shall draw some of them 
here. We have 


(«) 


(l/x 6 )(l/y 6 ) ‘ (l/x«)(l/y 4 ) (l/* 2 )0/y 4 ) 


i 


(iv) 


(v) 


(l/x 4 )(l/y 2 ) (l/* 2 )(l/y 2 ) 



(l/* 2 )(l/y 2 ) (l/x 2 )(l/y 2 ) 



exact solution of the two-dimensional ising model 


475 


The analytic expression associated with each mixed dimer graph is 
shown below it. Graph (i) above corresponds to the first closed graph 
in Exercise 8.2, Eq. (3). Four mixed dimer graphs of the type (ii) 
above correspond to closed graphs, 2-5 in Exercise 8.2, Eq. (3). The 
two mixed dimer graphs of the type (iii) above correspond to closed 
graphs 6 and 7, and the two mixed dimer graphs of the type (iv) above 
correspond to closed graphs 8 and 9 in Exercise 8.2, Eq. (3). Two 
mixed dimer graphs of the type (v) above correspond to closed graphs 
10 and 11 in Exercise 8.2, Eq. (3). Four mixed dimer graphs of the 
type (vi) above correspond to closed graphs 12-15 in Exercise 8.2, 
Eq. (3). And finally, graph (vii) corresponds to the last graph in 
Exercise 8.2, Eq. (3). Thus there are 16 possible mixed dimer graphs. 
They each consist of 26 vertices and a mixture of solid and dashed 
dimer bonds. The partition function is 

Z N (t) = 2 9 e~ Kx e~ 6Ky costf {K x )cos\\ 6 (K y ) 

x Jt 6 y 6 ^P(all mixed dimer graphs). (3) 


► S8.B.4. Partition Function for the Infinite Planar Lattice 

We now have enough information to obtain an analytic expression for the 
partition function of an infinite planar lattice. For simplicity we will consider a 
square planar lattice. The procedure is to extract an n x n segment from the 
infinite lattice, and then take the limit, n — > oo. In order to demonstrate the 
procedure, we will first extract a 3 x 3 segment from the infinite planer lattice 



Fi & 8.10. A 3 x 3 segment of an infinite square directed graph corresponds to a 
^4 x 54 mixed directed graph. 
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and describe its properties in detail. Then we will go on to write partition 
function for an n x n segment and take the limit, n — * oo. 


► S8.B.4. 1. 43x3 Segment of the Infinite Square Planar Lattice 

Let us consider a 3 x 3 segment of an infinite square planar lattice. We first 
construct a 54 x 54 mixed directed graph as shown in Fig 8.10 using the rules 
of Section S 8 .B. 3 . We can then assign an antisymmetric matrix, A54, to the 


directed 

mixed graph. We obtain 
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( 8 . 173 ) 


where “ 0 ” in the matrix ( 8 . 173 ) represents 
zero: 


r = r 6 = 


xeex 6 = 


y = y 6 = 


/ 0 1 1 

-1 0 1 

-1 -1 0 

00-1 
000 
\ 0 0 0 

/ 0 000 
0000 
0000 
0000 

l/x 0 0 0 

\ 0 000 

/o 0 0 0 
0000 
0000 
0000 
0000 

\0 l/y 0 0 


a 6 x 6 matrix with all elements 


0 0 0\ 

0 0 0 

1 0 0 

0 1 1 

-1 0 1 

-1 -1 0 / 

0 0\ 

0 0 
0 0 
0 0 ’ 

0 0 

0 0 / 

0 0\ 

0 0 
0 0 
0 0 ’ 

0 0 
0 0 / 


(8.174) 
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Fig. 8.11. The 54 x 54 directed lattice with bonds connecting opposite sides added. 


The subscript 6 is added to show that T 6 , X&, and F 6 are 6 x 6 matrices. This 
notation will be useful later. In constructing the mixed directed graph, Fig. 8.10, 
we have left out its connections to the rest of the infinite mixed directed graph, 
and the antisymmetric matrix, A 54 , does not include these bonds. 

Let us now take our 54 x 54 directed lattice and change it by giving it 
periodic boundary conditions. We will connect opposite sides to one another by 
adding bonds and then add the contributions from these new bonds to the 
antisymmetric matrix. It turns out that as we consider larger and larger 
segments of the infinite planar lattice, these additional terms contribute less and 
less to the determinant of the antisymmetric matrix, and in the limit of an 
infinite lattice they do not affect the result. In Fig. 8.1 1 we redraw the 54 x 54 
directed lattice so that bonds on opposite sides are connected. We then no 
longer have a planar lattice. Note the directions of these added bonds are 
opposite to normal connections to the infinite lattice. The antisymmetric matrix, 
A 54 , that results from adding these new bonds to A 54 is given by 


( r 

X 

X 7 

— Y t 
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-Y 

0 

0 N \ 

-X 7 

r 

X 

0 

— Y T 

0 

0 

-F 
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— Y T 
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( 8 . 175 ) 
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The antisymmetric matrix, A 54 , is now in a form that is easy to deal with. 
First note that we can write A 54 as the tensor product of smaller matrices (tensor 
products are defined in the subsection below). We can write 

/ D 1S -h®Yl -h®Y 6 \ 

A 54 = j h®Y& Dig —I,®Y\ 

\h®Y T 6 h®Y 6 D, 8 / 

= h ® Ois + £3 ® (h ® Yl) - £ 3 _1 ® (h ® ft), (8.176) 



■ Tensor Product of Matrices. We introduce the tensor product of two 
matrices, A and B, as follows. We write C = A 0 B, where C is a fourth- 
order tensor whose (ijkl) th element is given by 

(fyijU = (A ® ^)y/W ~ AijBkl- (1) 

where Ay and 2?*/ are the (y)th and (kl) th elements of matrices A and 
respectively. The product of two fourth-order tensors, Q and 05, is given by 

(Q05)^y E Q imkn ^mjnl > (2) 


where and M m j n i are the (imfcn)th and (mjnl ) th elements of the tensors 
(Q> and 05, respectively. The tensor product of a matrix A with the unit matrix 
7 is 


(A 0 /)y-;y — AijSkj, (3) 

where 6k, i is the Kronecker delta function. IfB and C commute, then we can 
write 


(A 0 B)(C 0 D) = (AC) 0 (BD). 


(4) 
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The matrix Dig can also be written as a tensor product: 

Dis = h 0> Tg — £3 0 X $ + £3 1 0 xj . ( 8 . 179 ) 

Therefore, the antisymmetric matrix (A) 54 takes the form 

A54 = h ® (h ® r 6 ) -/ 3 0 (£3 ® x 6 ) + h <8> (E 3 -1 ® X %) + £3 0 (/ 3 0 Yl ) 
-£ 3 - 1 0(/ 3 0F 6 ), ( 8 . 180 ) 

where we have used the fact that E\ — £j*. Let us next introduce the matrix £3, 
which diagonalizes £3. That is, 

/A, 0 0\ 

Pj'E 3 P3 = 0 A 2 0 , ( 8 . 181 ) 

\ 0 0 X}J 

where A* is the kth eigenvalue (k = 1, 2, 3) of the matrix £3 and is given by 
X k = exp(27ri&/3). Then we find that 

A ' 4 = (P 3- 1 ® (P 3 - 1 ®/ 6 ))A 54 (/>3 ® (P 3 ®/ 6 )) 

= /3 ® h ® r 6 - / 3 ® (Pj'EiPy) 0 X 6 

+ /3®(P3 1 £ 3 “'P3)®^ 

+ (Pj'EsPs) ®/ 3 ® yj - (PJ'Ej'Ps) ®/ 3 ® y 6 . ( 8 . 182 ) 

We next note, for example, that 

/ r 6 0 0 \ 

/ 3 ®r 6 = o r 6 0 (8.183) 

\o 0 r 6 / 

and 

/a,x 6 0 0 \ 

(P3 1 £ 3 P 3 )0X 6 = 0 A 2 X 6 0 . ( 8 . 184 ) 

\ 0 0 a 3 x 6 / 

Then one can readily show we have obtained an expression for A ' 54 which is 
block diagonal. It can be written 

D { 1 1 oooooooo\ 

0 D^l 0 0 0 0 0 0 0 

0 0 D { ?1 0 0 0 0 0 0 

000 Dg 00000 
0000 £>g 0000, ( 8 . 185 ) 

00000 Dg} 000 
0 0 0 0 0 0 D® 0 0 

0000000 D^Jo 
0 0 0 0 0 0 0 0 D ^ l ) 
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where 


of] = r 6 - x,x 6 + {*!+ Wl -jY 6 


( 0 

i 

1 

0 

x/X i 

°\ 

-1 

0 

1 

0 

0 

y\j 

-1 

-1 

0 

1 

0 

0 

0 

0 

-1 

0 

1 

1 

—x\ i 

0 

0 

-1 

0 

1 

V o 

-y/Xj 

0 

-1 

-1 

0 / 


( 8 . 186 ) 


O') 

The determinant of D)J is given by 

**®-‘ + (;)’ + G)’ 



<;M”) 



(¥)-e)‘e 


cos 


/27TI 

\T 


( 8 . 187 ) 


Let us now note that due to the cyclic permeability of matrices under the 
determinant and the fact the determinant of a product of matrices is equal to the 
product of determinants of those matrices, we obtain 


i = 1 7=1 


-2|-|cos(^l 



( 8 . 188 ) 


These results are easily generalized to an n x n segment of the infinite planar 
lattice. 


► S8.B.4.2. An n x n Segment of an Infinite Square Planar Lattice 

Let us now consider an n x n segment of the infinite square spin lattice. The 
mixed directed lattice that we construct will have 6 n 2 lattice sites, and therefore 
the antisymmetric matrix that we associate with the directed lattice will be 
6n 2 x 6n 2 -dimensional. We proceed exactly as before. We introduce the n x n- 
dimensional matrix, E n , which consists of all zeros except for (i) matrix 
elements — 1 along the nearest upper off-diagonal and (ii) the matrix element 1 
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in the bottom left comer. We shall restrict outselves of odd values of n. Then the 
eigenvalues of E n are given by A* = exp(2irik/n), where k — 1,2, ... ,n. The 
determinant of the antisymmetric matrix, A 6n i , obtained by connecting opposite 
bonds of the 6n 2 x 6n 2 -dimensional directed lattice will be given by 



=rm<. (8 - i89) 

j= 1 k= 1 


The partition function can be written 
Z,,(T) = 

X x"("- i y ( "'i)y'det[A & ,!]. (8.190) 


If we remember that x = tanh(^) and y = tanh(£y) and note that 
e -A: sinh(^) =tanh(A')/[l -ftanh(A')], then with the help of Eq. (8.189) we 
can write the partition function as 


Zn{T) = 2”' 




nn< 


y=l 


k= 1 


(8.191) 


The logarithm of the partition function becomes 

ln[ Zn(T)] = n 2 ln(2) + n(n - l)ln^- ^ - ^ +n(n- l)ln 




(8.192) 


»=i j=i 


We now can take the thermodynamic limit. It is useful to note that 
linV- oo £]” =1 (l/n)cos(27ri/n) = J o 27r d0cos(0). Then for the free 

energy density (free energy per lattice site), a(T), we find 


<T) 

k B T 


= lim„. 


+ 


1 1 

247T 2 


>00 o 

\n l 

2n 


HUT)} 


= In 2 + In 


1 + x 


+ In 


i+y 


d(}> i 


2n 


d(j> 2 In 


11 12 2 
1 + ^2 + ^2+^2--COS(</>i) -~COs(</> 2 ) 


2 2 
+ ^ c °s(«1 2 ) + ^j co s(«1,) 


(8.193) 
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We can simplify subsequent calculations if we let J x = J y . Then 
x = y = tanh(A') and the free energy density takes the form 


1 1 ( 2n f 2 

+ 24?J 0 Ho 


d<p 2 In 


i 2 1 

1+ i? + s 


'2 2 

~ (---t)(cos(0i) + cos(<£ 2 )) 


X X- 


-ln(2)-21n(l+*)+-^-j 

*2ff 


2ff 


d(f>\ 


( Iff 

d(f >2 In [l + 2.x 2 + x 4 + 2x( 1 — * 2 )(cos((/>i) + cos(</> 2 ))]. 
o 

(8.194) 

We now can make use of the identities 

2x(l - x 2 ) = sinh(2/s:)/cosh 4 (/<:) and l+2x 2 +x 4 = cosh 2 (2/sT) /cosh 4 (tf). 

(8.195) 

Then the free energy density becomes 
a(T) 


k B T 


= In (2) - 2 K 


+ 


i_Lf 
2 47t 2 J 


2ff 


r2ff 


d(j)\ 


d(f >2 ln[cosh 2 (2/Q + sinh(2^T)(cos(</)i) + cos(</> 2 ))]- 


Equation (8.196) can be simplified still further. Note that cos(</>i) + cos(<H = 
2cos[|(0i + 0 2 )]cos[|(^i — <f> 2 )]. If we make the change of variables, 
6\ = ^ (0i — (f> 2 ) and 0 2 = | (0i + 0 2 ), and let k = 2sinh(2AT)/cosh 2 (2AT), 
then the free energy density becomes 


a(T) 

k B T 


= ln(2) — 2AT -h In [cosh (2K)] 


i r 

+ 47T 2 J . 


7r r 2n 

d6\\ d $2 ln[l + «cos(#i) cos(0 2 )]. 

-n JO 


(8.197) 


We can integrate Eq. (8.197) if we note that [24] 

»2?T 


/* Lff 

dx\n(a + &cos(;c)) = 2n r ln[(a + Va 2 — b 2 )/2]. (8.198) 

Jo 
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k B T 


= ln(2) -2K + In [cosh (2 K)] 


2 

+ - 
TT 


dO\n ^1 + yj 1 - K 2 sin 2 (6)j . 

The internal energy density is easily obtained from Eq. (8.199). We find 


(8.199) 


u(T) = J M (t^P) = 27 “ 7coIh < 2 ' f )[ 1 + (2 tanh 2 (2AT) - 1 )|k(k)], 

( 8 . 200 ) 

where K(k) is the complete elliptic integral of the first kind [25] and is defined 
as 


1 tt/2 jq 

— — • ( 8 - 201 ) 
0 yl — K 2 sin 2 (0) 

The specific heat is given by 

c = If = “ = -*Bir 2 coth 2 (2A:){2K( K ) - 2E(k) 

oT T oK 7r [ 

- 2 sech 2 (2/sT) + (2 tanh(2/sT) - l)K(/c)J 

The complete elliptic integral of the first kind, K(«), has a singularity for k = 1. 
The phase transition occurs at this point. The temperature at which the phase 
transition occurs is then given by the condition 


2sinh(2/f c ) 
cosh 2 (2AT c ) “ 


(8.203) 


This gives a critical temperature, T c = 2.269 J/k B . In Fig 8.12 we plot the 
mtemal energy per site and the specific heat as a function of K , which is 
proportional to the inverse temperature. 

Ising-like phase transitions have been measured in the two-dimensional 
Ising-like antiferromagnets, K 2 CoF 4 and Rb 2 CoF 4 . These substances behave 
like two-dimensional spin systems because they consist of strongly coupled 
antiferromagnetic CoF 2 planes separated by weakly coupled planes containing 
the remaining molecules. In Fig 8.13 we show two measurements of the heat 
Opacity of Rb 2 CoF 4 . We see the characteristic Ising-like singularity in the heat 
Opacity. 
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Fig. 8.12. (a) The internal energy per 
site as a function of K. (b) The specific 
heat as a function of K. The phase 
transition occurs at K c = 1 /2.269 = 
0.4407. 



105 rp HO 


Fig. 8.13. The heat capacity of Rb 2 CoF 4 
(given in arbitrary units) as a function of 
temperature, T, measured in Kelvin. Two 
sets of experimental data are shown. 
Reprinted, by permission, from H. Ikeda, 
I. Hatta, and M. Tanaka, J. Phys. Soc. 
(Japan) 40, 334 (1976). 
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In the section we have obtained analytic expressions for the free energy, the 
internal energy, and the heat capacity of a two-dimensional square planar Ising 
lattice. From Eq. (8.202) we can obtain the critical exponent, a. The type of 
singularity that occurs in the heat capacity is easy to find from properties of the 
comple te ellip tic integrals. Let us note that as k, — * 1, E(k) — ► 1 but K(k) — > 
ln(4/\/l — k?). Thus, the singularity in the heat capacity is logarithmic and 
the critical exponent, a, for the two-dimensional Ising lattice is a = 0. 

It is interesting to compare Fig. 8.12, the exact heat capacity of the two- 
dimensional Ising lattice, to that of the mean field approximation given in Fig. 
7.8. Mean field theory predicts a finite discontinuity, while the exact solution 
gives a logarithmic singularity. The two theories also give different critical 
temperatures. 
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PROBLEMS 


Problem 8.1. A one-dimensional lattice of spin-j lattice sites can be decomposed into 
blocks of three spins each. Use renormalization theory to determine whether or not a 
phase transition can occur on this lattice. If a phase transition does occur, what are its 
critical exponents? Retain terms in the block Hamiltonian to order (V), where Vis the 
coupling between blocks. 

Problem 8.2. Find the critical exponents for five spin blocks on a square lattice for the 
two-dimensional nearest-neighbor Ising model. Retain terms to lowest order in (V), 
where V is the interaction energy between blocks (cf. Fig. 8.14). 

• * * T * * 



i = VE 


Fig. 8.14. 


Problem S8.1. Consider the directed lattice shown in the figure. Assume that horizontal 
bonds have a coupling constant, J x , and that tilted vertical bonds have coupling constant, 
J y . (a) Find the antisymmetric matrix, A, associated with this lattice. Compute its 
determinant, (b) Draw all disconnected dimer graphs for this lattice. Show that they 
represent \/A. 
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Problem S8.2. Consider the directed lattice shown in the Figure. Assume that horizontal 
bonds have coupling constant, J x , and that vertical bonds have a coupling constant, J y 
(a) Find the antisymmetric matrix, A, associated with this lattice. Compute its 
determinant, (b) Draw all disconnected dimer graphs for this lattice. Show that they 
represent \/A. 



Problem S8.3. Consider the planar triangular lattice shown in the figure. Assign a 
coupling constant, J, to each bond, (a) Compute the partition function and draw all 
closed graphs for this lattice, (b) Draw the directed mixed graph for this lattice and 
obtain the antisymmetric matrix it generates. Show that the determinant of the 
antisymmetric yields the correct expression for the partition function. 
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9.A. INTRODUCTION 

One of the great successes of equilibrium statistical mechanics has been the 
theory of interacting fluids. For real fluids, the interparticle potential has a hard 
core and a weak attractive region. Consequently, the usual perturbation theories 
do not work. For dilute or moderately dense fluids, the only small parameter is 
the density and therefore, various thermodynamic quantities can be formulated 
in terms of a density expansion. 

In this chapter we shall restrict overselves primarily to neutral fluids with 
spherically symmetric potentials. The quantity of central importance in 
describing such systems is the radial distribution function, which is a measure 
of the probability of finding a particle at some distance, q, from any particle in 
the fluid. The radial distribution function can be measured directly by X-ray 
scattering experiments or it can be obtained from molecular dynamics 
experiments. We shall begin this chapter by deriving an expression for the 
internal energy in terms of the radial distribution function, and in the special 
topics section we will derive expressions for both the pressure and 
compressibility in terms of the radial distribution function, using both the 
canonical and grand canonical ensembles. 

In the chapters on thermodynamics, we have already been introduced to the 
virial expansion of the equation of state. We shall now derive an expression for 
the virial coefficients using microscopic theory. The method involves writing 
the grand partition function in terms of a cumulant expansion and then 
expressing the grand potential and pressure in terms of the cumulants or cluster 
functions. The techniques for doing this were first developed by Ursell, whose 
work was later extended by Mayer. The analysis is greatly facilitated by the use 
of graph theory which enables one to give a simple physical interpretation to 
various terms in the virial expansion. The first few virial coefficients have been 
computed for a variety of interparticle potentials. We shall discuss the results 
for the hard-core potential, the square-well potential, and the Lennard-Jones 
6-12 potential and compare them with experiment. 

For very light molecules at low temperatures, quantum effects introduce 
significant deviation from classical behavior. In the special topics section we 
will construct the virial expansion for interacting degenerate quantum fluids. 
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9.B. THERMODYNAMICS AND THE RADIAL 
DISTRIBUTION FUNCTION [1-5] 

In this chapter we will compute the equilibrium properties of a classical fluid 
consisting of N particles with mass m whose dynamics is determined by a 
Hamiltonian of the form 


N 2 N(N- 1)/2 

n N ( pW) = ££-+ E v K). 


i=l 


(y)=i 


(9.1) 


where q,y = q, — q j is the relative displacement of particles, i and j (we use the 
notation of Section 6.B). It is useful to introduce the one-body phase function, 
0^(q N ) = Y^!i= i 0\ (q,) (the kinetic energy is an example of a one-body phase 
function), and the two-body phase function, 0% (q N ) = 02 (q/ — q ; ) 

(the potential energy is an example of a two-body phase function). The average 
values of these phase functions can be written 


(0?) = = J«fq,Oi(q 1 K(q 1 ; V,T) (9.2) 

and 

(°2)=\ d <h^ d q202(qnq 2 )n%(q\,q2-,V,T), (9.3) 

where n^q^V, T) and n% (qj, q 2 ; V, T) are the one-body and two-body 
reduced distribution functions, respectively, in the canonical ensemble. They 
are defined 


(Qi>- ••»<!/, V,r) Q N (y, T) (N - /)! ‘ ' ' dqN e P ’ ( 9 ‘ 4 ) 

where V N = V(q,y) is the potential energy. In Eq. (9.4) the quantity 

Qn(V,T) is called the configuration integral and is defined as 


Qn{VJ) = 


dq N e 


N-f3\ N 


(9.5) 


The reduced distribution functions have the property that 


* AH 

dq r -- dqrf (qj , . . . , q C,V,T)= — j . (9.6) 

One of the simplest quantities to compute is the internal energy, U(V, T,N). 
The internal energy is just the average value of the Hamiltonian and can be 



490 


INTERACTING FLUIDS 


written 


J dq N V 




U(V,T,N) = (H) = -Nk B T+ f dqNe ^ 


= ^Nk B T + ] - |dq,dq 2 V(q, 2 )n5'(q 1 ,q 2 ;V,r). 


(9.7) 


The term \Nk B T is the kinetic contribution to the internal energy, and 
the remaining term is the contribution due to the interaction between the 
particles. 

The expression for the internal energy simplifies considerably for systems 
whose particles interact via spherically symmetric potentials, V(q,y) = V(^), 
where = |q ( - — q,| is the magnitude of the relative displacement. Then the 
two-particle reduced probability density takes the form n^(q 1? q 2 ; V, T) = 
n%( qi 2 ; V, T). Furthermore, it will depend only on the magnitude, qij, of the 
relative displacement. If we change the integration variables to relative, q^, and 
center-of-mass, Q,y = \ (q, + q ; ), coordinates and let n% (q 12 ; V, T) — 
{^) 2 g%{qiy,V,T), then we can integrate over the center-of-mass coordinates 
and find 


3 IN 2 

U{V,T,N) = -Nk B T + 


4irq 2 dqV(q)g%{q-,V,T), (9.8) 


where q = qu- The function (q; V, T) is called the radial distribution 
function. The radial distribution function is extremely important because it 
completely characterizes the behavior of a classical fluid of spherically 
symmetric particles. It also has a direct physical interpretation. The quantity 
(N /V)g{q)A , Kq 2 dq is the average number of particles in a spherical shell ot 
width q — > q + dq at a distance q from any particle in the fluid. 

The form of the radial distribution function can be determined from neutron 
scattering experiments in which slow neutrons scatter from atomic nuclei in a 
liquid. The angular dependence of the scattered neutrons is measured, and 
this information can be used to construct the static structure factor, Snn{k) 
(cf. Exercise 9.1), of the liquid. The structure function can be expressed in 
terms of the radial distribution function. In Fig. 9. 1 we show the structure factor 
and the radial distribution function obtained from neutron scattering 
experiments on liquid 36 Ar at 85 K. The radial distribution function goes to 
zero at about the hard-core radius, indicating that no particles can penetrate the 
hard core. It has a maximum at about the distance of the minimum of the 
attractive potential between nearest-neighbor argon atoms in the liquid. The 
next peak is due to the high probability of finding next nearest neighbors at that 
position, and so on. 
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I . I . 1 1 I 1 1 1 — 

0 5 10 15 q 20 25 


Fig. 9.1. (a) The structure factor, S(k) = S„„(k), versus k (in A -1 ) for liquid 36 Ar at 85 
K obtained from neutron scattering experiments. The dots are data poins. The solid line 
is a best fit to the data, (b) The radial distribution function, g{q), versus q (in A) obtained 
from the data in (a). Reprinted, by permission, from J. L. Yamell, et. al., Phys. Rev. A7, 
2130 (1973). 


I EXERCISE 9.1. The static density correlation function, C„„(r), for an 
equilibrium fluid can be written C„„(r) = (1/N) fdr'(n(r' + r)n(r')), 
where the density phase function is defined n(r, q N ) = J2? =l S(q,- — r). 
The static structure factor for the fluid is defined as S„„(k) = J dre ikr C nn ( r). 
Show that in the limit N — > oo and V — > oo with n = N jV = constant, the 
static structure factor when written in terms of the pair correlation function 
has the form 


Snn(k) = 1 + «<S(k) + 


47 r 


qdqsin(kq) (g 2 (q) - 1). 


I 
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| Answer: The static structure factor may be written 
S«n(k) =~j | dre lkT J dr'{n(r' + r)n(r ')) 

= ^ f dr'dre ikT ^ £ (%,• “ r ' “ r ) 6 (<lj ~ 0) 

J i = 1 j = 1 

=iEE(^ ) )- a) 

'=1 7=1 

If we evaluate the average value in the canonical ensemble, we find 

i r i r 

= ' +^|dqi |dq2« , ’‘ <1 "i2 V (qi,q2, v,T) 

= 1 + ^J rf q<2 cik,,2 ^(«i2; v * r ) (2) 

The minimum value of the wavevector, k, is determined by the size of the 
box. In the limit N —* o o and V — * oo with n = N/V = constant, the length 
of the wavevector can be zero. We can separate this contribution from the 
remainder of the integral. We find 

S»(k) = 1 + n<5(k) +n d^^(f 2 (q n -,V,T) - 1). (3) 

The integration over angles in Eq. (3) can be performed to finally give 

S«n(k) = 1 + n<5(k) +~^qdqsin(kq){g^ (, q ; V, T) - 1). (4) 

The structure factor can be measured in neutron scattering experiments, and 
the term «<5(k) is the contribution due to coherent forward scattering. 


9.C. VIRIAL EXPANSION OF THE EQUATION OF 
STATE [1-3, 7, 8] 

The first step in obtaining tractable microscopic expressions for the thermo- 
dynamic properties of fluids is to find a small parameter that can be used as an 
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expansion parameter. For dilute or moderately dense gases, such a parameter is 
the density. In this section we shall first obtain a microscopic expression for the 
virial expansion of the equation of state (density expansion) for a classical fluid, 
and then we shall compare the predictions of the microscopic theory to 
experimental results. 


9.C.I. Virial Expansion and Cluster Functions 

Let us consider a classical gas of identical particles of mass m which interact via 
two-body short-ranged forces. We shall assume that the potential has a large 
repulsive core and short-ranged attraction (cf. Fig. 9.2). For such a system the 
grand partition function can be written in the form 

00 1 1 

Z tl (T,V) = '£--^e^Q N (V,n (9.9) 

N=0 iy ' A T 

where the momentum integrations have been performed and the configuration 
integral, Qn{V, T), is defined in Eq. (9.5). 

If the interparticle potential, Vy, is short-ranged, then for large values of 
separation between particles we obtain Vy = 0 and exp(— /3Vy) = 1. It is 
convenient to introduce a function, fy, such that 

fa - e-^i - 1. (9.10) 

The function fy becomes zero outside the range of the interaction. Furthermore, 
in the region of the hard core, where Vy = oo, the function fy = — 1. Thus fy is 
a much better expansion parameter than Vy (cf. Fig. 9.2). In terms of the 



Fig. 9.2. The dashed line is a plot of a typical interparticle potential, v(x) (Lennard- 
Jones 6-12), the solid line is a plot of f(x) — e ~ !Mx > — 1, and a is a measure of the hard- 
c °re radius (v(l) = 0). The plots are given for a fixed temperature. 
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function fj, the configuration integral can be written as 


Qm(V,T) 


H(n-m 

• • • dqyy n c i+ft )■ 

(y) 


(9.11) 


In Eq. (9.11) the product is taken over all pairs of particles ( ij ). There are 
N(N — l)/2 such pairs. 

It was shown by Ursell [9] that the partition function can be written in terms 
of a cumulant expansion. We first write the configuration integral in the form 

Qn(V,T)= [• •• Ul,- •<i<l A ,WV(q l ,...,q w ), (9.12) 


so that 

00 i i f f 

Z,(T, *0 = E m J • • J <*li ' ' ' ••,«*)• (9-13) 

We now define the following cumulant expansion for the grand partition 
function: 


z n( T , v ) = ex P 


£ T\ }' "} rfqi ' i> • • • »%) 


(9.14) 


where t//( q l5 . . . , q z ) is called a cluster function of Ursell function. In terms of 
the cluster functions, the grand potential takes a simple form: 


f2(v,r,/i) = -forinfz^r, v)] 


°° i i r r 

= ~ kl)T Jl 71 K5? e,V j ” • J d< »! • ■ ..,%)• 


(9.15) 


If we know the function W^(q . .^q^), then we can easily find the cluster 
functions Un{ q l5 . . . , q#). We simply expand Eqs. (9.13) and (9.14) in powers 
of (Af 3 exp(/3//')) and equate coefficients. We then obtain the following 
hierarchy of equations: 

i/i(qi) = W,( qi ), (9.16) 

£Mqi,q 2 ) = w 2 (q 1} q 2 ) - ^i(qi)Wi(q 2 ), (9.17) 

^(qi,q 2 ,q 3 ) = W 3 (qi,q 2 ,q 3 ) - Wi(q,)W 2 (q 2 ,q 3 ) 

- Wi(q 2 )W 2 (qi,q 3 ) - Wi(q 3 )W 2 (q 1 ,q 2 ) 

+ 2Wi(q ] )W,(q 2 )W l (q 3 ), 


(9.18) 
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and so on. We can revert Eqs. (9.16)-(9.18) and find 

Wi(qi) = £/i(qi), (9-19) 

W 2 (qi,q 2 ) = ^(qi,q 2 ) + £/i(qi)C/i (q 2 ), (9-20) 

W 3 (qi,q2iQ3) = t / 3(qi,q 2 ,q 3 ) + £A(qi)f/ 2 (q 2 ,q 3 ) + ^i(q 2 )^ 2 (qi,q 3 ) 

+ C/ 1 (q 3 )C/ 2 (q 1 ,q 2 ) + f/i(qi)f/i(q 2 )f/i(q 3 ), (9.21) 


and so on. The cumulant expansion we have just introduced is independent of 
the form of WW(qi , • • • , %)• 

For the systems we consider in this chapter, we can write 

N(N- 1)/2 

WV(q„...,q*)= I] ( 9 - 22 ) 

(0 


Thus, the first few terms are 


H',(q,) = l > (9.23) 

W , 2 (q„q 2 ) = (l+/i 2 ), (9-24) 

W 3 (qi,q 2 ,q 3 ) = (1 +/ 12 XI +/o)(l +/23)> (9-25) 

M , 4(qi,q 2 ,q 3 ,q4) = (1 +/i 2 )(l +/b)(1 +/u)(1 +/2 3 )(1 +/ 24 XI 4 -/ 34 ), 

(9.26) 

and so on. If we expand out the expressions for , . . . , q^), we see that the 
case N = l contains one term, the case N = 2 contains two terms, the case 
N = 3 contains eight terms, and the case iV = 4 contains 64 terms. From 
Eqs. (9.16)— (9.18) we can find the first few cluster functions. They are 


C/i(qi) = 1, 

^2(qi,q 2 ) =/i 2 , 


(9.27) 

(9.28) 


and 


^3(qi,q 2 ,q 3 ) —fnfn +/i 2 / 2 3 +/i 3 / 2 3 +/i 2 /i3/ 2 3- (9.29) 

The expression for t/^qj, q 2 , q 3 , q 4 ) is too long to write down here, so we will 
leave it as an exercise. It contains 38 terms. It is easy to see that the functions 
^v(flb • • • ,q^) and L/jv(qi> • ■ • ,q^) rapidly become very complicated as N 
increases. 

Since the grand partition function contains terms with arbitrarily large values 
°f N, it is necessary to find a systematic procedure for categorizing and sorting 
various terms in the expressions for Wjv(q, , • . . , q^) and £/^(qi, . . . , q^). Such 
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a procedure is called graph theory. We can represent W^qj , . . . , q^) in terms 
of graphs in the following way: 

WV(qi , . . . , q*) = ^ (all different //-particle graphs). (9.30) 

An /V-particle graph is obtained by drawing //-numbered circles and connecting 
them together with any number of lines (including zero) subject only to the 
condition that any two circles cannot be connected by more than one line. Two 
A-particle graphs are different if the numbered circles are connected differently. 
An algebraic expression can be associated with an //-particle graph by assigning 
a factor fy to each pair of numbered circles that are connected by a line. For 
example, the six-particle graph is expressed as 


= 734 / 25 / 26 , (9-31) 


and the five-particle graph is expressed as 

© © © 

= / 45 - (9.32) 

©— © 

The graphical expression for W 3 (q 1} q 2 ,q 3 ) can be written 

^ 3 (^ 1 , Q 3 ) — ^ ^ +3 ^ ^ -4- £ ^ 3-+- A , (9.33) 

where we have grouped together all graphs with the same topological structure 
before labels are attached. 

From Eqs. (9.16)-(9.18), it is not difficult to see that the cluster functions can 
also be represented by graphs, but of a special form. We find that 

Ui(q l , . . . , q/) = ^2 ( a N different /-clusters). (9.34) 

An /-cluster is an /-particle graph in which all numbered circles are 
interconnected. Thus, in an /-cluster it must be possible to reach all numbered 
circles by continuously following the lines connecting the circles. Graphical 
expressions for the first few cluster functions are given by 

£/i(qi)=«, (9.35) 

£Mqi,q 2 )=*-* (9.36) 

t/ 3 (qi,q2,q3) = 3^ # + ^, (9.37) 
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^4(qi,q 2 ,q 3) q4) =12 + 12 Pt 

+ 3 n + 6 (9.38) 


Again, we have grouped together all graphs which have the same topological 
structure before lables are attached. 

We see from graph theory that the cluster functions have a special meaning. 
An Aspartic le cluster function U N { q t , . . . , q#) is nonzero only if all N-particles 
interact. Thus, they contain information about the way in which clusters occur 
among interacting particles in the gas. By writing the partition function in terms 
of a cumulant expansion, we have isolated the quantities which have physical 
content. 

There is also another reason for writing Z^(T, V) as a cumulant expansion, 
which does not become apparent until we try to integrate over the functions 
H^v(qi, . . • , q#). Let us consider some examples and restrict ourselves to four- 
particle graphs. 


(a) Each unconnected dot gives rise to a factor V. For example, 


.-f dq ' 


dq 4 


• • 


and 


M 


dq r -dq 4 


•J)-H 

"'III 


rfq, • ■ - dq 4 1 — V 4 

dq x • • • ^4/24/34 
dq 2 dq 3 dq 4 f 14 f 34 . 


(b) If two parts of a graph are not connected by a line, the corresponding 
algebraic expression factors into a product. For example, 



1 • ■ • dq 4 



dq { • • -^4/12/34 



^1^2/12^ ^ dq 3 dq 4 f 3 ^j 


(c) Each completely connected part of a graph (cluster) is proportional to 
the volume (this comes from integrating over the center of mass of a 
cluster). 


For example, 


dq, J) = (vjdqn/12) (1^34/34) 
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and 

| ■ J dqi ■ • ■ dq 4 (HHJI dq ] dq 2 dq 3 fnfi?, /13 

== ^ 2 |l^ 31 ^^ 21 ■^(l^ 31 l)^(l^ 21 l)^(l^ 31 — ^21 1) 7 

where q 31 = q 3 ~ Qi and q 21 = q 2 - q v In the second example, we have 
integrated over the position of particle 1 in the cluster. This gives a factor V 
since the cluster can be anywhere in the box. The positions of particles 2 and 3 
are then specified relative to particle 1 (it is useful to draw a picture). Thus, the 
configuration integral Qn(V, T) contains terms which have volume dependence 
of the form V a , where a can range anywhere from a = 1 to a = N. The 
integrals over cluster functions, however, are always proportional to the volume. 
We can write 


W) = - 


dq, • • • dq [ Ui(q l , . . . , q/), 


(9.39) 


where bi(V, T) is called a cluster integral and, at most, depends on the density. 
Thus, the expression for the grand potential, Q(U, T, //), in Eq. (9.15) is 
proportional to the volume. 

We are now in a position to obtain the virial expansion of the equation of 
state. From Eqs. (9.15) and (9.39) we may write the grand potential as 


V(V,T,n') = -Vk B Tj2 


b[(T , V)e^ 


i=\ 




The pressure then becomes 


P 


V 


00 


k B T^2 

l=\ 


bi{T , V)eM 


and the particle density is given by 

(N) _ 1 /9f2\ _ Y~ lb,(V 

V - VWJyjTfe \j! 


(9.40) 


(9.41) 


(9.42) 


The virial expansion of the equation of state is an expansion in powers of the 
density, 


PV 



1 - 1 


(N)k B T 


(9.43) 
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To determine the Ith virial coefficient, B u let us for simplicity consider all 
equations in the thermodynamic limit (V — ► oo, (N) — » oc, such that 

(N) /V =constant). Then bi(T,V ) >limitay_ +00 ®/(r). If we combine Eqs. 

( 9 . 41 ), (9.42), and (9.43), we obtain 



bi(T)e^'\ 

A? ) 


£ 

, n=l 


nd n (T)e^' S 


-1 


\3 n 


= E b ME 




V - 1 




tn'=l 


\3n' 
/\ y 


(9.44) 


If we now expand both sides of Eq. (9.44) and equate coefficients of equal 
powers of Af 3 exp(/fy/), we obtain the following expressions for the first four 
virial coefficients: 


s,(r) = »i(r) = l, (9.45) 

B 2 (T) = -»2 (T), (9.46) 

B 3 (r)=4^(r)-2S 3 (7-), (9.47) 

Bi(T) = -20 b\(T) + 18B 2 (7’)^(7') - 354(7"), (9.48) 


and so on. The higher-order terms in the virial expansion are determined by 
larger and larger clusters of particles. 

The functions fj depend only on relative coordinates. Consequently, we can 
show that 


y\\\ dq x dq 2 dq 3 


A) = lb- 

Similarly, 

- • b • • MLI) • • b (14 


dq i 


and 


I r 

V 


j * j^qi = 




dq 2 dq 3 


These identities enable us to simplify the expressions for the virial coefficients 
and write them as 


Bn(V,T) 


_I — ! — f 

n(n — 2)! J 


dqi •••rfq„V B (qi,...,q n ) J 


(9.49) 
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where V„(q,, . . . , q n ) is called a star function and 

V n (q lf . . . , qj = (all different n-particle star graphs). (9.50) 


An n-particle star graph is a connected graph which is so tightly connected that 
if we remove any point and all lines that connect to it the remaining graph will 
still be connected. 

In terms of graphs, the first few star functions are given by 


^(qi,q 2 ) 

(9.51) 


V3 (<li , q 2 , <13) 

(9.52) 

and 



V 4 (qi,q 2 , 

93^4) = 3 ■ 

(9.53) 


The numerical factors in front of each graph in Eq. (9.53) indicate that there are 
that many ways to assign the coordinates to the points and obtain topologically 
different graphs. Thus, there are that many different graphs. The nth-order virial 
coefficient is composed only of tightly bound connected n-particle graphs. 

We are now in a position to corppare the predictions of this theory to 
experimental observations. 


9.C.2. The Second Virial Coefficient [1] 

The second virial coefficient gives the correction to the ideal gas equation of 
state due to two-body clustering. For very dilute gases, two-body clusters give 
by far the dominant contribution from interactions in the fluid and it is sufficient 
to terminate the virial expansion at second order. 

From Eqs. (9.10), (9.28), (9.39), and (9.46), the second virial coefficient can 
be written 


Bi (T) = -^jjdq,dq 2 /(q 12 ) = -\ - 1), (9.54) 

where we have changed to center of mass and relative coordinates and have 
integrated over the center of mass. The behavior of the second virial coefficient 
has been studied for a variety of interparticle potentials. For very simple 
potentials it can be computed analytically, and for realistic potentials it must 
be computed numerically. We shall focus on two potentials which historically 
have been important in understanding the behavior of the virial coefficients. 
They are the square- well potential and the Lennard-Jones 6-12 potential 
(cf. Fig. 9.3). 



VIRIAL EXPANSION OF THE EQUATION OF STATE 


501 



Fig. 9.3. Sketch of (a) the square-well potential and (b) the Lennard-Jones 6-12 
potential. 


9.C.2.I. Square-Well Potential. 

The square-well potential is shown in Fig. (9.3. a) and has the form 

{ oo 0 < q < (7, 

-e <J < q < Rcr , (9.55) 

0 Rcr < q. 

The square-well potential has a hard core of radius, <r, and a square attractive 
region of depth e and width (R — l)cr. The second virial coefficient can be 
computed analytically and has the form 

97HT 2 

Bi(T) sv = [1 -(R 3 - 1)(<A - 1)]. (9.56) 

Note that # 2 (T) SW differs from # 2 (T) HC by a temperature-dependent term (see 
Ex. 9.2). At low temperatures it is negative and at high temperatures it becomes 
positive. At low temperatures the attractive interaction energy due to the square 
well can compete with the thermal energy, k B T, and causes a lowering of the 
pressure relative to the ideal gas value. At high temperature the hard core 
becomes dominant and the pressure increases relative to that of an ideal gas. 

We can write #2(7%w i n a reduced form if we let 5*(T) SW = 
5 ( r )sw/^o(^o = 27ra 3 /3) and T* = k B T/e. Then we find 

BUT ) sw = [1 - (* 3 - l)(e 1/r - 1)]. (9.57) 

Equation (9.57) will be useful when we compare the square-well results to 
e xperiment. 
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9.C.2.2. Lennard- Jones 6-12 Potential 

A potential which gives a very good approximation to the interaction between 
atoms is the Lennard-Jones 6-12 potential, 


V L j(tf) = 4e 



(9.58) 


(cf. Fig. 9.3b). The Lennard-Jones potential has a gradually sloping hard core, 
which takes account of the fact that particles with high energy can, to some 
extent, penetrate the hard core. When q — a we obtain V L j(cr) = 0. Thus, q — a 
is the radius of the hard core when the potential changes from positive to 
negative. The minimum of the Lennard-Jones potential occurs when 
q = (2) 1//6 <t and the value of the potential at the minimum is 
Vlj((2) 1//6 <t) = —e. Thus, e is the depth of the Lennard-Jones potential. 

The second virial coefficient for the Lennard-Jones potential can be found 
analytically in the form of a series expansion. If we integrate Eq. (9.54) by parts 
and introduce the notation x = q/cr, T* = k B T/£, and B\{T) U = B 2 (T) u /b 0 , 
we find 


B\(T) u =- 


dxx 2 


12 

v 12 




(9.59) 


If we expand exp[(4/r*)(l/x) 6 ] in an infinite series, each term of the series can 
be computed analytically and we obtain the following expansion for B 2 (T) U : 


Bi(T)u = 


00 / i \ (( 2 ” +1 )/ 4 ) 

J2 an (yd 

71=0 V / 


(9.60) 


where the coefficients a are defined in terms of gamma functions as 


Oi n 



(9.61) 


Values of the coefficients a n for n = 0, . . . , 40 are given in Ref. 1, p. 1 1 19. In 
Table 9.1 we give all values for n = 0, ...,12. The expansion for B 2 (T) LJ 
converges rapidly for T* > 4, but more slowly for lower values of T*. Values of 
B* 2 {T) u for T* ranging from 0.3 to 400 are given in Table 9.2. 

In Fig. 9.4 we plot B\{T) versus T* for both the square-well potential and the 
Lennard-Jones potential. We also give experimental values of B 2 (T) for a 
variety of substances. The Lennard-Jones potential gives values of B 2 (T) in 
good agreement with experimental results. At high temperatures, B 2 {T) U and 
the experimental points for He gas exhibit a maximum, while 5 2 *(7’) sw does 
not. The maximum occurs because at high temperatures, particles can penetrate 
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Table 9.1. Coefficients for the Expansion of the 
Second Virial Coefficient for Lennard- Jones 6-12 
Potential 0 


n 

Oi n 

n 

cx n 

0 

+ 1.7330010 

7 

- 0.0228901 

1 

- 2.5636934 

8 

- 0.0099286 

2 

- 0.8665005 

9 

- 0.0041329 

3 

- 0.4272822 

10 

- 0.0016547 

4 

- 0.2166251 

11 

- 0.0006387 

5 

6 

- 0.1068205 

- 0.0505458 

12 

- 0.0002381 


“Based on Ref. 1. 


Table 9.2 Values of the Reduced Second Virial 
Coefficient versus the Reduced Temperature for 
the Lennard-Jones Potential 0 


r 

*2 

r 


0.30 

- 27.8806 

4.00 

+ 0.1154 

0.40 

- 13.7988 

4.50 

+ 0.1876 

0.50 

- 8.7202 

5.00 

+ 0.2433 

0.70 

- 4.7100 

10.00 

+ 0.4609 

1.00 

- 2.5381 

20.00 

+ 0.5254 

1.50 

- 1.2009 

30.00 

+ 0.5269 

2.00 

- 0.6276 

40.00 

+ 0.5186 

2.50 

- 0.3126 

50.00 

+ 0.5084 

3.00 

- 0.1152 

100.00 

+ 0.4641 

3.50 

+ 0.0190 

400.00 

+ 0.3583 


“Based on Ref. 1. 


into the hard core and lower the amount of excluded volume. The square- well 
potential has a hard core with infinite slope and cannot account for this effect, 
while the Lennard-Jones potential, with a sloping hard core, can account for it. 
The data points for He deviate from the classical results at low temperature. 
These deviations are due to quantum effects and will be discussed in 
Sec. (S9.D). The second virial coefficients for all classical gases, when plotted 
in terms of reduced quantities, are identical. This is an example of the law of 
corresponding states. 

In Table 9.3 we have given a list of values for the parameters s/k B and a for 
various substances, assuming they interact via square-well or Lennard-Jones 
potentials. These parameters are obtained from experimental values for the 
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Fig. 9.4. The reduced second virial coefficient. The solid line is the calculated curve for 
the Lennard-Jones 6-12 potential. The dashed line is the calculated curve for the square- 
well potential (for R = 1.6). The points are experimental values for the gases listed. 
(Based on Ref. 1.) 


second virial coefficient. Thus, measurements of the virial coefficients of real 
gases provide an extremely important way of determining the form of the 
effective interparticle potential for various molecules. The values of e can be 
found if we take experimental values of BjiJ) for two different temperatures, 


Table 9.3 Values of the Parameters a and e for the Square- Well (S-W) and 
Lennard Jones (L-J) 6-12 Potentials Taken from Experimental Data on the Second 
Virial Coefficient" 



Potential 

R 

<x(A) 

e/k B (K) 

Argon (A) 

S-W 

1.70 

3.067 

93.3 

190 °C 600 °C 

L-J 


3.504 

117.7 

Krypton (Kr) 

S-W 

1.68 

3.278 

136.5 

160 °C — ► 600 °C 

L-J 


3.827 

164.0 

Nitrogen (N 2 ) 

S-W 

1.58 

3.277 

95.2 

150 °C — > 400 °C 

L-J 


3.745 

95.2 

Carbon dioxide (C0 2 ) 

S-W 

1.44 

3.571 

283.6 

0°C ^600°C 

L-J 


4.328 

198.2 

Xenon (Xe) 

S-W 

1.64 

3.593 

198.5 

0°C -^700°C 

L-J 


4.099 


Methane (CH 4 ) 

S-W 

1.60 

3.355 

142.5 

0°C 350 °C 

L-J 


3.783 

148.9 


'Based on Ref. 10. 
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sa y T\ and T 2 , equate the two ratios 

(1(F)) = Sfe’rl!** )' (9 ' 62) 

\Bl {' l)J exper \~*2 ^ 1 itheor/ 

and solve for e. In this ratio, all dependence on a is eliminated. Once we find e, 
we can find a if we note that 

Bi(T) apa = — (9.63) 

The only unknown in Eq. (9.63) is a. 

The Lennard- Jones 6-12 potential is perhaps the most widely used 
interparticle potential, but there are many other forms of potential that may 
be used to compute the virial coefficients, and some of them give better 
agreement with experimental results over a wider range of temperature than 
does the Lennard- Jones potential. 

Computation of the third virial coefficient is more challenging but still can be 
done for some simple potentials. However, new effects can enter, such as three 
body forces which are not included in the virial expansion given here. Some of 
these effects are described in Section S9.C. Also, in Section S9.C the third virial 
coefficients for hard spheres and for particles coupled by a square-well potential 
and by the Lennard-Jones potential are computed. 


i ■ EXERCISE 9.2. Compute the second virial coefficient, B 2 (T), for a hard 
i sphere gas of atoms of radius, R, confined to a box of volume, V. Write the 
| equation of state for this gas as a virial expansion to first order in the density. 


I Answer: The second virial coefficient, B 2 {T), is defined as 

j 

Bi(T) = ~“Jd q i Jd q2 / (q 2i), (1) 

where q 21 = q 2 — qi and/(q 21 ) = (e~ /3v ( q 2 i) — 1). The interaction potential, 
V(q 2 i), for hard spheres is given by V(q 21 ) = oo for q 2 \ <R and 
V(q 2 i) = 0 for q 2 \ > R. We can make a change of variables and let 
9 = 92 ~ 9i and Q = \ (q t + q 2 ). Then 


MT) = -~jdQ Jrfq/(q) = - \ U(<r /5V " ,) - 1) 

. f* 2 27 TR 3 
= +27T dqq = — — = b 0 . 

Jo 6 

The equation of state, to first order in the density, is 

PV . . N 

__=i 
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9.C.3. Higher-Order Viral Coefficients [2] 

As we increase the density or decrease the temperature of a gas, at some point it 
will undergo a transition to a liquid or a solid phase. Past this point, and well 
before we reach it, the virial expansion ceases to give good agreement with the 
observed equation of state. It is possible to obtain some idea of the limits of the 
virial expansion for a hard-sphere gas. Although hard-sphere gases do not exist 
in nature, they can be constructed on a computer by means of molecular 
dynamics calculations. In these calculations, the motion of a finite number of 
particles is followed by solving the equations of motion of the system. The 
phase space coordinates of all the particles can be obtained at any given time, 
and the phase functions of interest can be computed. The thermodynamic 
properties are obtained by time averaging of the phase functions of interest. 

The second through sixth virial coefficients for systems of hard spheres have 
been computed [11] and the results are Bi(T) = b 0 , B-x,{T)=^b^ 
B 4 {T) = 0.29b 3 0 , B 5 (T) = 0.11^, and B 6 (T) = 0.04^, where b 0 = 2'no 3 /3 
and a is the hard-core radius. The virial expansion for the equation of state is 
plotted in Fig. 9.5 and compared to “experimental” results from molecular 
dynamics calculations. In the figure, a curve is given for the virial expansion up 


OQ 

fe; 

cl, 




V 0 /V 


Vo/V 


Fig. 9.5. Equation of state for hard-sphere and hard-disk fluids, (a) Hard-sphere fluid: 
Line 1 includes all virial coefficients up to B s ; line 3 gives the prediction of the Pade 
approximate; molecular dynamics results are indicated by the dots, (b) Hard-disk fluid: 
All quantities have meaning similar to part (a). (Based on Ref. 11.) 
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to and including B 5 (T) and the virial expansion up to and including B 6 (T). The 
agreement gets better as we add more terms, but it is not extremely good at 
higher densities. Better agreement occurs if we construct the so-called Pade 
approximate to the virial expansion. The Pade approximate is obtained as 
follows. If we are given a finite expansion for some function F(x), 


F(x) =a 0 + aix-\ h 


(9.64) 


we can construct the Pade approximate to F(x) by writing it in the form 


F « = 


b 0 + b\X 4- bjx?' + • ■ • + b m x m 
cq -)- c\x H h cix 1 


(9.65) 


If we expand Eq. (9.65) in a power series, we can find the coefficients 6 , and cj 
from the coefficients a n . If we know a finite number of coefficients, a n , then we 
can find a finite number of coefficients bt and cj. By doing this, we replace a 
finite series expansion by an infinite series expansion. 

Ree and Hoover [11] have constructed the Pade approximate to the equation 
of state for a hard-sphere gas. They obtain 


PV 

Nk^T 


\+bo 


N (l + 0.063507^o 7 + 0.0 1 7329^ (v) 2 ) 
V (l - 0.561493h 0 f + 0.08 13 13b 2 0 (£) 2 ) 


(9.66) 


The Pade approximate to the equation of state is plotted in Fig. 9.5. It gives 
much better results than the simple virial expansion. It is worth noting that 
the hard-sphere gas exhibits a phase transition from a fluid state to a 
crystalline state as the density increases. While we will not give the details 
here, Fig. 9.5 shows the results of an analogous calculation for a system of 
hard disks. The drop in the pressure at about V 0 /V = 0.75 indicates a transition 
to a crystalline state. The Pade approximate gives poor agreement for the crystal 
phase. 

Expressions for higher virial coefficients for square-well potentials have 
been obtained, but, as we can see from the results for hard spheres, a simple 
virial expansion does not give very good agreement for dense systems. 


► SPECIAL TOPICS 

► S9.A. The Pressure and Compressibility Equations 

ft is possible to express other thermodynamic quantities in terms of the radial 
distribution function. Below we derive expressions for the pressure and the 
compressibility in terms of the radial distribution function. 
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► S9.A.1 . The Pressure Equation 

We can obtain an expression for the pressure by using a simple trick. Consider a 
box of volume V = L? {L is the length of a side) which contains N particles. We 
will assume the box is large enough that we can neglect the effects of the walls 
(in the thermodynamic limit this is rigorously true). From Eqs. (2.98) and 
(7.44), we can write 


p T)]) r - *ar(Aln[e„(V, m) 


~(1 L HQ N (V, T )] 


The configuration integral can be written 


(9.67) 


Qn{V, T) = \ L dqi ■ ■ ■ f L dqwe-t"^ = V" f • • • [ ' <fe, • ■ ■ dx 3N e^ v ^\ 

JO JO JO JO 

(9.68) 

where we have made the change of variables, q, = Zjc,. The derivative of 
Qn{V,T ) with respect to L gives 


9Qn(V,T)\ 3 N 


Qn{V,T) 


0yN I* " ■ I dx> ' dx ™ (~ dir) TN e - mi *" ) ■ 


If we now note 


dV(Lx^) = l^dq l j av(q, ? ) = l „ 
at %at ■ aq„ 2 L^r d% 

we can combine Eqs. (9.67), (9.69), and (9.70) to give 


3V(q y) 


(9.69) 


(9.70) 


^J dq > J d<h q > 2 ■ ^r"^ (qi ’ q2; v ’ T) - 


av(q 12 ) V, 


(9.71) 


For spherically symmetric potentials, Eq. (9.71) takes the form 


P N Ann 2 (3 f 00 3j dV(q) 

J„ qdq ^r 


< > >dq dr&( < i' v ’ T ')- < 9 - 72 ) 

Jo oc l 
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Thus we find that the pressure depends only on the radial distribution function. 
Eq. (9.72) is called the pressure equation. 

p- S9.A.2. The Compressibility Equation 

As a final example, let us obtain an expression for the isothermal 
compressibility, kt,n • For this, we need the variance of the particle number, 
as was shown in Eq. (7.1 17), and therefore we must now describe the system in 
the grand canonical ensemble. The average values of one- and two-body phase 
functions can be written 

<0,) = j<iq,0 1 (q 1 K(q,;V',r) (9.73) 

and 

( Oi ) = ^dq x ^dq 2 0 2 {q l ,q 2 )n%(q l ,q 2 -,V,T) {9.1 A) 

where n^{q x ; V, T) and n^{q x , q 2 ; V, T) are the one -body and two-body reduced 
distribution functions, respectively, in the grand canonical ensemble. They are 
defined 


«f(q q/; v,t)= 


OO 1 

Etvztt 


Z»{VJ)fa{N-i)\h™ 


jdp"jdq, +1 ...dq„.-«™. 

(9.75) 


The one-body and two-body reduced distribution functions, in the grand 
canonical ensemble, have the property that 


(A7) = Jdq 1 <(q 1 ; V,r) (9.76) 

and 

(N 2 ) - <iV) = j dq, | dq 2 n£(q, , q 2 ; V, T). (9.77) 


If we now combine Eqs. (7.117), (9.76), and (9.77), we can write 

_ i = jrfqt pq2K(q,.q2; v, r) - <( q, ; v, rKta; v, r)]. 

(9.78) 
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For a system whose particles interact via a spherically symmetric potential, it is 
useful again to let n%( q 12 ; V, T ) = V, T). Then Eq. (9.78) takes 

the form 

( dn\ T f 00 

— J = [^1 + (n) dqW[&(q-, V, T) - l]j , (9.79) 

where (n) = (N)/V. Equation (9.79) is called the compressibility equation. 

The compressibility equation can be written in terms of the structure 
function, h(q) = g^{q) — 1, and takes the form 

isr (i?) =1 + < n > ( 9 - 8 °) 

The structure function contains all possible information about correlations 
between particles a distance q apart. Near a phase transition, where 
{dn/dP) T — * oo, it must become very long-ranged so the integral will diverge. 

► S9.B. Ornstein-Zernicke Equation 

Omstein and Zemicke [12] introduced the idea of decomposing the structure 
function into a short-ranged part, C(q), and a long-ranged part. They wrote the 
structure function in the form 

h(qn) = C(q n ) + (n) j^q 3 ( 9 - 81 ) 

Equation (9.81) is called the Ornstein-Zernicke equation. The first term, C(qn) 
(called the direct correlation function ), contains the effects of short-ranged 
correlations, and the second contains the effect of long-ranged correlations and 
allows for interactions between the particles 1 and 2 which first propagate 
through the medium. If we introduce the Fourier transform 

h{ k) = j dq n e lkqi2 h(qn) (9.82) 

and a similar one for c(qn), we can write Eq. (9.81) in the form 

h( k) = C(k) + (n)C(k)/*(k), (9.83) 

where C(k) is the Fourier transform of the direct correlation function, C(qn)- 
In terms of the quantity h( k), the compressibility equation takes the form 

KS) r =1+< " >S(k=0) - 


(9.84) 
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Fig. 9.6. The structure factor, S(k) = S nn (k), versus k (in A -1 ) for liquid 36 Ar obtained 
from neutron scattering experiments (dots), and from molecular dynamics (solid line) 
using the Lennard-Jones potential with parameters fitted to Argon. Reprinted, by 
permission, from J. L. Yamell, et. al., Phys. Rev. A7, 2130 (1973). 


If we take the reciprocal of Eq. (9.84), we find 

" ' -<*> W«>- 

(9.85) 

Since ( dP/dn) T — > 0 near the critical point, C(q) must remain short-ranged. 

The direct correlation function has been obtained from molecular dynamics 
simulation for a classical fluid of 864 atoms which interact via a Lennard-Jones 
potential, V(r ) = 4 e[(a/r) n — (a /r) 6 ]. In Fig. 9.6 we first make a comparison 
between the structure factor, S nn {k), obtained from neutron scattering 
experiments on argon and that obtained from molecular dynamics using the 
Lennard-Jones potential. The agreement is extremely good. The direct 
correlation function obtained from the molecular dynamics using the 
Lennard-Jones potential is shown in Fig. 9.7. The direct correlation function 
is negative in the region of the hard core and then rises sharply to a positive 
value outside the hard core. 

The Omstein-Zemicke equation, in general, must be solved numerically. 
However, Percus and Yevick [14] introduced an approximation which gives 
quite good agreement with the experimentally measured radial distribution 
function; and for the case of a hard sphere gas can be solved analytically. Percus 
and Yevick chose to write the direct correlation function in the form 

C Py (q) = grr (‘ ?)0 - ^ V(,) ). (9.86) 

When this is substituted into the Omstein-Zemicke equation, one obtains the 
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Fig. 9.7. The direct correlation function, qC(q), versus qjo (a is the hard-core radius) 
for a Lennard-Jones fluid, obtained from a molecular dynamics simulation. Reprinted, 
by permission, from L. Verlet, Phys. Rev. 165, 201 (1968) 


Percus-Yevick equation 

gPY{qn) = e 0Viqn) + neP v( - qn) j dq 3 g/>r(?i3)(l - (9.87) 


where h PY {q ) = gpy{q ) - 1. 

Watts [15] has used the Percus-Yevick equation to compute the equation of 
state for a fluid whose particles interact via the Lennard-Jones potential. In 
Fig. 9.8 we show his results for a number of isotherms. The Percus-Yevick 
equation does predict the phase transition from the liquid to vapor phase. There 
is a large region where the Percus-Yevick equation has no solution. This region 
separates different branches of each isotherm of the equation of state and 
corresponds to the coexistence region. 

Wertheim [16] and Thiele [17] independently showed that the Percus-Yevick 
equation for a hard-sphere fluid can be solved analytically. Since the radial 
distribution function obtained from the Percus-Yevick equation is approximate, 
we do not expect the pressure and compressibility equations to give the same 
results for the equation of state. Thiele obtained the following equation of state 
from the pressure equation: 


„_ N u ^(l+^ + Sx 2 ) 

P ~V k ° T (I-,)’ ’ 

where x = | n. From the compressibility equation, he obtained 

_(l+£±ff) 

P ~— kBT (1-xf ■ 


(9.88) 


(9.89) 


In both equations the pressure becomes infinite at x = 1, which is a density 
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Fig. 9.8. Isotherms of the equation of state for a Lennard-Jones fluid obtained by 
solving the Percus-Yevick equation. Note that n* — no 3 and T* = ksT/e, where (1) 
T* = 1.2, (2) T* = 1.263, (3) T* — 1.275, and (4) T* = 1.3. The dots are experimental 
points for argon at T* = 1.3. For the region enclosed by the dashed line, the Percus- 
Yevick equation has no solutions (based on Ref. [10]). 


greater than the close-packed density, x cp = 27rv / 2/3. Thus, the equations give 
unphysical results for very dense systems. However, at lower densities they give 
good agreement with the results of molecular dynamics experiments [18]. 


► S9.C. Third Virial Coefficient [1, 3] 


The third virial coefficient contains the contribution from three-body clusters in 
the gas and may be written in the form 


HT) = - 


1 

3V 


' f 

dq l dq 2 dq 3 C/ 3 (qi,q 2 ,q 3 ) + 


yi 


dqidq 2 C/2(qi,q 2 )j • 
(9.90) 


Equation (9.90) is completely general and does not depend on the form of the 
interparticle potential. In deriving microscopic expressions for the cluster 
functions, we have assumed that the N-particle potential was additive — that is, 
that it could be written as the sum of strictly two-body interactions: 


( l/2)N(N-l ) 

V"(q") = Yl 

Vi) 


(9.91) 
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In fact, in a real gas this is not sufficient. For example, if three bodies in a gas 
interact simultaneously, they become polarized and an additional three-body 
polarization interaction occurs. This polarization interaction has a significant 
effect on the third virial coefficient at low temperatures and must be included 
[19]. Let us write the total three-body interaction in the form 

V 3 (q 3 ) = V(q 12 ) + V(q 13 ) + V(q 23 ) + AVi 2 3(q r ,q 2 ,q 3 ), (9.92) 

where Avi 2 3 is the three-body polarization interaction. The polarization 
interaction has been computed by several methods [20,21] and has the form 

Avi 23 = Q^ 1 +3c° s (7i)cos(72)cos(7 3 )) 

<?12tfl3tf23 

where are the lengths of the sides of the atomic triangle, 7, are the internal 
angles, and a is a parameter that depends on the polarizability and properties of 
the two-body interaction. Thus, the polarization interaction is repulsive. 

If we include the correction due to polarization effects, we can write the third 
virial coefficient as 


where 


B } (T) = + AB } {T), 

Bi (T) = — yy ]|| dq 2 dqy/(q i 2 )f(Mn)f(q 2 ^) 


and 


A B 3 (T) = - 


1 

3V 


I 


dqidq 2 dq 3 (e /3AVm — \)e ^( v » +v m+v») > 


(9.94) 

(9.95) 

(9.96) 


The third virial coefficients for additive potentials and the corrections due to 
nonadditive polarization effects have been obtained for a number of potentials. 
We shall discuss some of these results below. 


► S9.C.1. Square-Well Potential 

An expression for the additive third virial coefficient for the square- well 
potential was first obtained analytically by Kihara [1, 21]. For /?<2, it has the 
form 


B 3 (T)f* = \b 2 0 [ 5 - (. R 6 - 18i? 4 + 32 R 3 - 15)* 


- (2 R 6 - 36 R 4 + 32 R 3 + 18 /e 2 - 16)x 2 

— (6R 6 — 1 8/? 4 + 1 8/? 2 — 6)x 3 ], 


(9.97) 
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Fig. 9.9. The third virial coefficient, (a) The additive contribution as calculated for the 
Lennard- Jones 6-12 potential and the square- well potential for R = 1.6 and R = 2. (b) 
The polarization contribution for the potentials in (a). (Based on Ref. 10.) 


while for 2, it has the form 

*3 (T)% = \bl\5 - \lx - (-32K 3 + 18S 2 + 48)* 2 
- (5 R 6 - 32 R 3 + 18/? 2 + 26)jc 3 ] 

where x = [e^ £ — 1]. In computing the third virial coefficient, the values of R, s, 
and a obtained from the second virial coefficient are generally used. Sherwood 
and Prausnitz have plotted values of # 3 ( 7 ’)^ and the correction Ai? 3 (T) sw for 
a variety of values of R. Their results are shown in Fig. 9.9. At low temperature, 
the contribution from polarization effects can have a large effect. The corrected 
values give better agreement with experimental results than does the additive 
part alone. In Fig. 9.10 we compare the additive and corrected results with 
experimental values of Bs(T) for argon. 

► S9.C.2. Lennard-Jones 6-1 2 Potential 

The additive third virial coefficient for the Lennard-Jones potential can be 
obtained analytically in the form of a series expansion in a manner similar to 
that used for the second virial coefficient. However, it is much more 
complicated. The series expansion takes the form 

/ 1 \ — (*+l)/2 

B 3 (T)u = blJ20» ~ • (9.99) 

n = 0 ' / 



516 


INTERACTING FLUIDS 


Bz(T) (cc/mole) 2 



Fig. 9.10. Comparison of observed and calculated third virial coefficient for argon. The 
solid lines include the polarization effects. The dashed lines give only the additive 
contribution. (Based on Ref. 10.) 


A table of values for the coefficients f3 n may be found in Ref. 1 ; we will not give 
them here. Curves for B 3 {T)ff and AB 3 (T) U are shown in Fig. 9.9. They are 
very similar to the curves for the square-well potential. The Lennard-Jones 
curves for argon are compared with experimental results in Fig. 9.10. The 
Lennard-Jones potential does not give extremely good agreement with the 
experimental values for argon, probably because the potential well has 
insufficient curvature [10]. 


■ EXERCISE 9.3. Compute the third virial coefficient, B 3 {T), for a hard 
sphere gas of atoms of radius, R, confined to a box of volume, V. Use a 
geometrical method to find B 3 (T). Write the equation of state for this gas as 
a virial expansion to second order in the density. 


Answer: The third virial coefficient is written 


B 3 (T) = - 


1 

3V 


dq x dq 2 dq 3 f{q 2l )f{q 3l )f{q 32 ) 


( 1 ) 


First we change from coordinates (qi, q 2 , q 3 ) to coordinates (qi,q2i,q3i)> 
where q 2 i = q2 _ qi and q 31 = q 3 - q^ The Jacobian of this transforma- 
tions is one. We can integrate over q! to obtain 


* 3 ^ = 4 


dq2i<iq3i/(q2i)/(q3i)/(q3i ~ q2i)- 


( 2 ) 


Next make the change of variables, q = q3i - q2i and Q = 5^31 + q2i)- 
The Jacobian of this transformation is one. The third virial coefficient then 
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takes the form 


S3(T) = -3 


rfqdQ/lQ — i«l/(Q+^q)/(q)- (3) 


The integration over Q can be reduced to geometry. Consider the 
accompanying figure. 



The circle on the left represents the sphere Q < R and |q + ?QI< R. The 
sphere on the right represents the sphere jq - |Q| < R. The integration over 
Q in Eq. (3) gives the volume of the shaded overlap region. This volume is 
twice the volume of a spherical cap (for sphere of radius R) of height 
h = R — ^q. The volume of such a spherical cap is \nh 2 (3R — h). Thus, the 
integration over Q gives 


Bi(T) = 





(4) 


(there are two spherical caps). Integration over q gives 


B 3 (T) = 


5t t 2 R 6 
18 



The equation of state, to second order in the density, is 


L 


PV 

Nk B T 




5M 

8 



(5) 

( 6 ) 


► S9.D. Virial Coefficients for Quantum Gases [1, 22] 

For fluids composed of molecules of small mass, such as He, the classical 
expressions for the virial coefficients do not give very good results at lower 
temperatures (cf. Fig. 9.4). For such particles, the thermal wavelength 
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A 7 - = (27r h 2 /mk B T) 1 / 2 will be relatively large and quantum corrections must be 
taken into account. There are two kinds of quantum effects which must be 
considered: diffraction effects, which are important when the thermal 
wavelength is the size of the radius of the molecules, and the effects of 
statistics, which are important when the thermal wavelength is the size of the 
average distance between particles. 

To find general expressions for the virial coefficients, we proceed along lines 
similar to those for a classical fluid except that, for the quantum case, 
momentum variables no longer commute with position variables and cannot be 
eliminated immediately. The grand partition function for a quantum system can 
be written in the form 


00 R^'N 

Z,(T, V) = (9.100) 

n=o"‘ a t 

where X T is the thermal wavelength, 

W N (0) = X fN\e~ pk \ (9.101) 

and H N is the /V-body Hamiltonian; W N ((3) depends on the momentum and 
position operators for the N particles. As before, we can expand the grand 
partition function in a cumulant expansion, 


ZffJ, V) = exp 



e m 


Tr iUMU 


(9.102) 


where Ui{ff) depends on the momentum and position operators for / particles. If 
we equate coefficients of equal powers of the parameter e^' /X\ in Eqs. (9.100) 
and (9.102), we obtain 


Tr 1 [fir 1 (/3)]=Tr 1 [m (/?)], (9.103) 

Tr 2 [t/ 2 (/?)] = Tx 2 {W 2 (P)} - (Tr, [M^i (/3)]) 2 , (9.104) 

Tr 3 [t/ 3 (/?)j = Tr 3 [iy 3 (/9)] - 3 (Tr , [iV, (^)] ) (Tr 2 [ lV 2 (/3)] ) + 2(Tr,[Hr,(/3)]) 3 , 

(9.105) 


and so on. From Eq. (9.102) we can write the grand potential in the form 

_ 2 ° e Wl 

(1(V, T,fi) = -keT^—j, Tr, [£/,(/?)], (9.106) 

/= 1 1 - a t 

and the average particle density takes the form 
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where 

(9.108) 

The virial expansion for the equation of state may be written 

p OO 

— .'Emm®-', (9.109) 

where the virial coefficients, 08i(T), are related to the quantities bi(V,T) 
through Eqs. (9.45)-(9.48). Thus, 08\ = b\, 0S 2 = — ^3 = 4 b\ — 2 6 3 , and so 
on. In Exercises 9.4 and 9.5 we give some examples of quantum corrections to 
the classical virial coefficients. 


■ EXERCISE 9.4. Compute the second virial coefficient, B^T), for an 
ideal Fermi-Dirac gas of spin, S = |, particles. 

Answer: The second virial coefficient may be written 

Bl(T) = -2^Tr 2 [£>5(/3)] = -^Tr 2 [lV“(/3)] ^(Tr,^?)/?)]) 2 , 

( 1 ) 

where = 2\5’e~ /3 ^ 1+ ^ and W^((3) = , 7i = p?/2m is the 

kinetic energy operator for the ith particle. From Appendix B, Eq. (B.30) we 
can write 


Tri [ e -/?f, ] = EE (ki,5i|e 0T '\ki,si) = *E exp^— ^ = 2-^ 


Jt, ji=±i 

and 

Tr 2 [e~P^i+T2)} _ iEEEE 

ki ji=± 1 k2 i2=il 

[(ki , 51 ; k 2 , s 2 \e~^ +Tl) |k! , ; k 2 , s 2 ) 

- (ki , si ; k 2 , s 2 \e~^ Tx+I ' 2 ' ) \k 2 , s 2 \ ki , si)], 


T 

(2) 


= ?fE E < kl ^lk <m |kl>*l}') fE E < k 2' S 2l e 

Z V k, s,=±l / \ k 2 «-±l , 

4eeee (ki,5i|e |k 2 , 5 2 ) (k 2 , s 2 \e ^ r2 |ki,Ji) 


k] ij=±l k2 J2 = il 

1 4F 2 V 
2~Af 2 2 5 / 2 A| 


( 3 ) 
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If we combine the above equations, the second virial coefficient becomes 


B 


o 

2 


2V2‘ 


(4) 


Thus, Fermi-Dirac statistics causes an effective repulsion between particles 
in the gas. 


The only case that we can hope to treat analytically for an interacting 
quantum system is the second virial coefficient; because the Schrodinger 
equation can sometimes be solved analytically for two-body interactions. Below 
we show how that may be done. The second virial coefficient may be written 

Bi(T) = -~^ 2 [W 2 (D)] + (9.110) 

where = 2! A je~^ l+ ^ 2+ ^' 2 \Wi(/3) = A =pj/2m is the 

kinetic energy operator for the ith particle, and Vi 2 is the interaction potential 
between particles 1 and 2. We shall assume that the interaction potential 
depends on relative displacement, q = q! - q 2 , of the two particles and it may 
or may not depend on the spin of the two particles, V 12 = V(q,5'i,52)- It is 
useful to separate the kinetic energy into a center of mass and a relative part. If 
we introduce the center-of-mass momentum, P = + p 2 > and relative 

momentum, p = \ (p 2 - Pi), the total kinetic energy operator can be written 
T\ + T 2 = f cm + T re i, where t cm — (P 2 /4m) is the center-of-mass kinetic 
energy, and t re i = (p 2 /m) is the kinetic energy associated with the relative 
motion of the two particles. Let us consider the difference between the second 
virial coefficient for the interacting and noninteracting gas. We can write 


AB 2 = B 2 (T)-l4{T) = -^Tr 2 


e ~PT cm ^ e 0(Tr'l+Vn) _ e ' (9.111) 


The trace must be taken using symmetrized states for a Bose-Einstein gas and 
antisymmetrized states for a Fermi-Dirac gas. 

If we consider a spin zero boson gas, the symmetrized states are simply 


|k 1> k2) W =-^(|k,,k2> + |k 2 ,k 1 ». 


(9.112) 


and we can write 

ki k 2 


(9.113) 
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If the bosons have spin, then the states will also depend on spin and the 
expression for Tr 2 [W^/?)] becomes more complicated. 

Let us now consider a spin- ± fermion gas. We will let | denote s z = + i and 
let J, denote s z = - i. For each value of the momenta, ki and k 2 , there are four 
spin states that must be considered. The total spin of the two particles can be 
5 = 0 or 5 = 1. We let |ki,k 2 ;5,5' z )^ denote an antisymmetric state with spin 
S and z component of spin S z . Then the trace must be taken with respect to the 
states 


|k 1 ,k 2 ;0,0) W =l(|k,.k 2 ) + |k 2 ,ki»(| U> -| U», 
|ki,k 2 ;l,l) ( “ ) =^(|k 1 ,k 2 >-|k 2 ,k 1 ))|T,T>, 

|k,,k 2 ; 1,0) <*> =K|k,,k 2 > - |k 2) k,»(| T,i) + | U», 
|k,,k 2 ; l,-l) w =^(|k,,k 2 ) - |k 2 ,k,»| u). 


With these states, we can write 


ab 2 

k, k 2 (S,s z ) 

X W(k 1 ,k 2 ;5,5 z |[e-^(e"^ +v,2) -6"^01|ki,k 2 ;5,5 z ) (a) . 


(9.115) 


If the interaction between particles does not depend explicitly on spin — that 
is, if Vi 2 = V(q) — then the summation over spin can be performed easily and 
Eq. (9.115) reduces to 


At 


Afi2,/rf = -^^^[ W (ki;k 2 |[e 0Tcm {e - e l k i , k 2 > 


( J ) 


ki k 2 


where 


+ 3 (a) (ki,k 2 j \e~ 0fcm e -^ fr “ +Vn) - e-P'”')] |k t , k 2 ) (a) ], (9.1 16) 


(9.117) 


I k i ;k2 ) (a) = -^(| k l, k 2> - | k 2? k l))' 


The momentum eigenstates, |kj,k2), can be written in terms of center-of- 
toass momentum, P — hK, and relative momentum, p = hk. That is, 
|ki,k 2 ) = |K)jk). The symmetrized and antisymmetrized states become 

|k, .k2 )W =^ |K>(| k > + | -k >) and Ik,,^' = -L|K)(|k) - | - k», 


(9.118) 
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respectively. If we substitute Eqs. (9.118) into Eqs. (9.113) and (9.116) and 
perform the sum over the center-of-mass momentum (first change it to an 
integral), we find for the Bose-Einstein gas 


e -( 3 (T re i+V n ) _ (9.119) 


and for the Fermi-Dirac gas 

— 2 3 / 2 At 


AB2jd = 


^[ w (k|(e-^ +Vl2) - e-^ fr ‘ l )\k) {s) 


+ 3 {a) (k\(e 


-Pitm+Vn) _ e -/JT«i) |k)(fl) 


(9.120) 


Before we can proceed further, we must compute the energy eigenstates, 
| E n ), of the relative Hamiltonian, H re i = p 2 /m + V(q). Let us denote the 
position eigenstates by |r) so that q|r = r|r). Then the eigenvalue problem, 
H re i\E n ) = E n \E n ), in the position basis becomes 

— -Vfy„(r) + V(r)V>„(r) = EM r), (9.121) 

m 


where i/>„(r) = (rlE,,). The energy eigenstates form a complete set which we 
can denote, Yin I E n )(E n \ = 1. Similarly, we will let Yl n „ I E no )(E no \ = 1 denote 
the complete set of energy eigenstates of the relative kinetic energy operator, 
T re i. If the interaction potential is attractive, then the spectrum of H re i may 
contain contributions from bound states, in addition to states which extend over 
the size of the container. In the limit of infinite volume, the spectrum may 
consist of bound states in addition to a continuum of states. 

It is useful now to change to the position basis. If we also insert a complete 
set of energy eigenstates, we find 


A B 2 ,be = -2 3/2 \l I dr 




(9.122) 


and for the Fermi-Dirac gas we have 


A Bijd = — 2 3//2 A t 


dr 


^|^ } (r)| 2 e pEn PE * 


+ 3['52\' l Pn ) ( r )\ 2e PEn -^2\^n}{ r )\ 2e PEni 

\ n n 0 

where V , „K r ) =^(r|£' n ) and (r|£„). 


(9.123) 



SPECIAL TOPICS: VIRIAL COEFFICIENTS FOR QUANTUM GASES 


523 


Let us now restrict ourselves to spherically symmetric potentials, 
V(r) = V(jr|). The energy eigenstates may be written in the form 


00 / 


M r) = ./(r)n-(M), 


(9.124) 


1=0 m=—l 


where r = (r, 9, <j>) denotes the relative displacement in spherical coordinates, 
and Yi )m {0,4>) is a spherical harmonic. The coefficient, R„,/(r), satisfies the 
Schrodinger equation 


h 2 d 2 (rR n j(r)) 1(1+ \)h 


mr dr 2 

If we note that 


then 


+ 


mr z 


R ,j(r) + V(r)R„,,(r) = E n ,R nf (r). (9.125) 


00 / 

M~ r ) = EE (-l)'R „A')Yl,m(0,<t>), 

1=0 m =-l 


oo / 


+\r) = E 

v 2 z=0 OT= _ Z 


(9.126) 


(9.127) 


and 


00 / 


^n\- r ) = “^53 53 R n,2/+l(r)T 2 / + l,2m + l(^,</>). (9.128) 

v 2 z=:0 m=: _ z 

We can now substitute Eqs. (9.127) and (9.128) into Eqs. (9.122) and (9.123). If 
we use the orthonormality of the spherical harmonics 


r 2 t\ 


d(f) 


-1 


d(cos(0))Y^ ml (e,<t>)Y hm (e,<l>) = 


(9.129) 


and use the fact that the energy eigenstates must be normalized to one so that 
J 0 r*dr |R WjZ (r)| 2 = 1, we then find for the spin-0 Bose-Einstein gas 


AR 2> = -2 3/2 A 3 ]T(4/+l) 


1=0 


E- 8£ "“-E e ~pE„ 0 ,2l 


(9.130) 


and for the spin— i Fermi-Dirac gas 


A B 1Jd = -2 3 / 2 Aj jr 


/=o 


(4/+i ) 5>~ /?£, " 2 '-5> 


J£n 0 ,2/ 


«0 


+ 3(4/ + 3) ( *~ / * En ’ 2,+ ' - 53 e 


—fiEn 0 ,U+\ 


(9.131) 
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For the spin-0 Bose-Einstein gas, only even angular momentum states 
contribute to the second virial coefficient. For the spin-^ Fermi-Dirac gas, 
only even angular momentum states contribute for pairs of particles with total 
spin 5 = 0, and only odd angular momentum states contribute for pairs of 
particles with total spin 5=1. 

We can simplify the calculation of the second virial coefficients still 
further with a simple trick. Let us assume that the system is contained in 
a large spherical box of radius R and that the surface of the sphere has 
infinitely hard walls. The most general form of solution of Eq. (9.125) can be 
written 


R n ,i(r) = A n! i[cos(6 n ,i)ji(k n ,ir) - sin (6 n ,i)rii(k n jr)\, (9.132) 

where k n i = y/mE n i / h and 6 n j is the phase shift of the wavefunction relative to 
its value for the noninteracting system. For the noninteracting system, 6 n j = 0. 
The Bessel functions have the property that for large argument, 
lim x _o Ji(x) — ► (l/jc)sin[jc — In/ 2] and lim^oo/i/^) — ► —(1 /*)cos[x — /7r/2], 
Thus, near the outer walls of the container we have 


Rn,i(r) 


A n ,i 

k n ,ir 


sin 


k n ,ir 



for r ph R. 


(9.133) 


At the outer wall we must have /?„,/(/?) = 0. Therefore, for the interacting gas 
we must have 


k„ t iR - \ In + 6 nj i = n { n, (9. 1 34) 

and for the noninteracting gas we must have 

kn o jR - \ln = n Q jn, (9. 135) 

where n; and n 0 i are integers which label the allowed energy levels in the 
spherical container. If we take the derivative of Eq. (9.133) [Eq. (9.134)] with 
respect to k n j(k no j) and then take the difference of the resulting equations, we 
find 


1 d6 n ,i _ dni dn 0 j 
7T dk n j dk n i dk no / 


(9.136) 


This is useful if we note that in the limit as R — ► oo, the energy levels in the 
container form a continuum and we can write = I dk(dn/dk ), where dn/dk 
is the density of states. 

Let us again consider the virial coefficient for the spin-0 Bose-Einstein gas 
[Eq. (9.130)]. If we explicitly separate out the bound state contributions (and 



SPECIAL TOPICS: VIRIAL COEFFICIENTS FOR QUANTUM GASES 
suppress their dependence on /), we can write 
Afi2,i* = -2 3/2 A^e-^* 

dSii(k) 


n,bd 


2 3 / 2 Ar ^ 

— ^E (4,+1) 

" 1=0 


f oo 

dk- — -rr^-e 
q dk 


Similarly, we find for the spin-^ Fermi-Dirac gas 


A B 2Jd = - 2 V1 \\ e~^ M 

n,bd 

23/2A 'E(4/ + 1) 


7 r 


1=0 


d6 2 i(k) o _p#i n 

o d k 


+ 


' C 

3(4/ + 3) 
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dk 


d6 2 i+j(k) pp/ r 


dk 
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(9.137) 


(9.138) 


We cannot say anything more about the second virial coefficient, unless we 
specify the shape of the spherically symmetric potential, V(r). We leave this for 
an exercise. 


| ■ EXERCISE 9.5. Compute the second virial coefficient for a dilute gas of 
| hard sphere spin-0 bosons at fairly low temperature. 

! Answer: For hard spheres of radius R = a, in an infinitely large box, the 
| scattering phase shift, 6i(k ), is given by 

! tan(W)= S' (1) 

For a dilute gas of bosons at fairly low temperature, the momenta, k , of the 
i particles will be fairly low. Therefore, we can approximate the Bessel 
functions in Eq. (1) by their asymptotic limits for small argument, and we 
find 


(kn \ 2l+ 1 

tan ($,(*)) » ^ 7 . (2) 

(2/ + 1)(1 • 3 • • • (2/ — l)) 2 

Thus for small ka, the dominant term comes from the partial wave, l = 0, j 
and 6o(k) « —ka. Hard spheres have no bound states. Therefore, the second j 
virial coefficient can be written 


n d0 2? I2 \j 

B 2 = B" f L 


7T 


dkae-^' m = B° + la\\. 


(3) 
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The second virial coefficient for the noninteracting gas is B\ = —X\/2 5 / 2 . 
Thus, we find 

B 2 = ~ 2^2 4 ( 4 ) 

While Bose-Einstein statistics tends to lower the pressure of the boson gas, 
the hard core tends to increase it. 
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Problem 9.1. Compute the second virial coefficient for a gas which interacts via the 
potential 

{ oo if q < R, 

^(q-XR) if R^q^XR, 

0 if q > A R. 

Problem 9.2. Consider a classical gas in a box of volume V. Compute the second virial 
coefficient, B 2 , for the Gaussian model, /(^y) = e~ aq K where a' 1 ! 2 V 1 / 3 . Sketch the 

effective potential, /3V(qij). 

Problem 9.3. Compute the second coefficient for the weakly coupled particles with 
potential V(q) = V 0 for <7 < /? and V(q) = 0 for q > R. 

Problem S9.1. Assume at low density the direct correlation function can be 
approximated by C(q) = - 1). For a low-density hard- sphere gas (radius R), 

use the Omstein-Zemicke equation to obtain the first-order correction (in density) to the 
ideal gas expression for the isothermal compressibility. Show that your result agrees 
with the result obtained when starting from the virial expansion, P/nk B T = 1 + nB 2 , 
where n~N/V and B 2 is the second virial coefficient. 

Problem S9.2. Consider a classical gas in a box of volume V. Compute the third virial 
coefficient, B 3 , for the Gaussian model, /(^y) = e aq \ where cC 1//2 V 1//3 . 

Problem S9.3. The third virial coefficient can be written 

B-i = dq 13 f(q 12 )f(q n )f(q i3 - qi 2 ). 

where/(q) = (e _ ^ v fa) — 1 ) and V(q) is the interaction potential between particles with 
relative position, q. The Fourier transform of /( q) is /( k) = JV k q /(q). The inverse 
transform is/(q) = (l/27r) 3 f e -ikq /(k). (a) Show that the third virial coefficient can 
be written B 3 = - A (1 /2tt ) 3 J dkf(k) /(-k) /(-k). (b) Compute /( k) for a gas of hard 
spheres with radius R. (c) Use part (b) to compute the third virial coefficient for a hard 
sphere gas. 

Problem S9.4. Compute the third virial coefficient for weakly coupled particles with 
potential V(q) = Vq for q < R and V(q) =0 for q > R. 

Problem S9.5. Compute the third virial coefficient for an ideal Fermi-Dirac gas of spin, 
s = 5 , particles. 

Problem S9.6. Compute the second virial coefficient for a dilute gas of soft-sphere spin- 
0 bosons at fairly low temperature. Assume that the spheres have radius r = a and a 
potential V(r) = +Vo for r < a and W(r) = 0 for r > a. Assume also that 
k 2 < mVo/h 2 . 




PART FOUR 


NONEQUILIBRIUM STATISTICAL 
MECHANICS 




hydrodynamic processes near 
equilibrium 


10.A. INTRODUCTION 

When a system is disturbed from its equilibrium state, quantities which are not 
conserved during collisions, or which are not order parameters for an 
underlying broken symmetry, decay rapidly to their equilibrium values. After 
a few collision times, only quantities which are conserved during the collisions, 
or order parameters, remain out of equilibrium. The densities of conserved 
quantities and order parameters entirely characterize the nonequilibrium 
behavior of the fluid after long times. The equations of motion for the 
densities of the conserved quantities and for order parameters are called 
the hydrodynamic equations. Some examples of conserved quantities 
may include particle number, momentum, and energy. Examples of order 
parameters may include average magnetization, or a complex function 
characterizing a superfluid state. If there are inhomogeneities in the densities 
of conserved quantities, then particles, momentum, or kinetic energy must be 
transported from one part of the fluid to another to achieve equilibrium. 
Therefore, very-long-wavelength disturbances will take a long time relax, 
whereas short-wavelength disturbances relax more quickly. This dependence of 
relaxation time on wavelength is a feature that characterizes hydrodynamic 
processes. 

Hydrodynamic equations describe the long-wavelength, low-frequency 
phenomena in a large variety of systems, including dilute gases, liquids, solids, 
liquid crystals, superfluids, and chemically reacting systems. For complicated 
systems, transport processes are often coupled. For example, in a multi- 
component system, it is possible to have a temperature gradient drive a particle 
current and a concentration gradient drive a heat current. Some complicated 
systems can have as many as 10 or 20 (or more) transport coefficients to 
describe the decay to equilibrium from the hydrodynamic regime. 

In 1932, Onsager showed that the reversibility of the dynamical laws on the 
microscopic level requires that certain relations exist between transport 
coefficients describing coupled transport processes. Onsager’s relations are of 
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immense importance because they enable us to link seemingly independent 
transport processes and thereby reduce the number of experiments that must be 
performed in order to measure all the transport coefficients. In this chapter we 
will derive Onsager’s relations and apply them to transport processes in reacting 
multicomponent systems and superfluid systems. At the same time we will 
show how the hydrodynamic equations for such complicated systems can be 
derived from a knowledge of the thermodynamics and symmetry properties of a 
system. 

Fluctuations about the equilibrium state decay on the average according to 
the same linear macroscopic laws (hydrodynamic equations) which describe the 
decay of the system from a nonequilibrium state to the equilibrium state. If we 
can probe the equilibrium fluctuations, we have a means of probing the 
transport processes in the system. The fluctuation-dissipation theorem shows 
that it is possible to probe the equilibrium fluctuations by applying a weak 
external field which couples to particles in the medium but yet is too weak to 
affect the medium. The system will respond to the field and absorb energy from 
it in a manner which depends entirely on the spectrum of the equilibrium 
fluctuations. According to the fluctuation-dissipation theorem, the spectrum of 
equilibrium fluctuations and the rate of absorption of energy from the external 
field can be expressed in terms of a response matrix which is related to the 
correlation matrix for equilibrium fluctuations. 

In this chapter we will derive the fluctuation-dissipation theorem and apply 
it to a variety of systems. The derivation requires several steps. We first 
introduce the Wiener-Khinchine theorem , which enables us to relate the spectral 
density matrix for fluctuations to the time-dependent correlation matrix for 
fluctuations in the system. We then introduce linear response theory and use the 
assumption of causality to obtain a relation between the real and imaginary 
parts of the response matrix. 

One of the simplest applications of linear response theory involves a 
harmonically bound Brownian particle immersed in a medium. If the Brownian 
particle is pulled away from its equilibrium position, it will decay back to 
equilibrium and dissipate energy into the fluid. We will obtain an expression for 
the linear response function of the Brownian particle and in terms of it derive an 
expression for the correlation function for equilibrium fluctuations in the 
position for the oscillator. 

In the special topics section we discuss important applications of the 
fluctuation^dissipation theory — for example, the scattering of light from a fluid. 
The electric field of the incident light wave polarizes the particles in the 
medium, thus allowing the light to couple to the medium. The light will be 
scattered by density fluctuations, and by measuring the spectrum of the 
scattered light we can measure the spectrum of the density fluctuations. We will 
find that the density fluctuations are of two types: thermal density fluctuations 
due to fluctuations in the local entropy and mechanical density fluctuations due 
to damped sound waves. For low-frequency and long-wavelength fluctuations, 
the spectrum of scattered light can be obtained from the linearized hydro- 
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dynamic equations, and, therefore, light scattering experiments give us a means 
of measuring transport coefficients in the fluid. 

In the special topics section we shall apply hydrodynamic theory to transport 
processes in electric circuits composed of different metals coupled together and 
held at nonuniform temperature. We shall also discuss the transport of mixtures 
across membranes. 

The hydrodynamic equations describe the long time behavior of a few 
hydrodynamic modes in a system with 10 23 degrees of freedom. The non- 
hydrodynamic degrees of freedom decay on a much faster time scale than the 
hydrodynamic modes and provide background noise. We can use the 
fluctuation-dissipation theorem to find the correlation functions for this 
background noise as we will show in the special topics section. 

It is also possible to derive correlation functions for microscopic Brownian 
particles using the hydrodynamic equations. The hydrodynamic flow of the 
medium around a Brownian particle creates memory effects which cause its 
velocity autocorrelation to decay with a slow long time tail. 

The poles of the spectral density matrix give us information about the 
spectrum of fluctuations in an equilibrium system. The poles corresponding to 
very-low-frequency, long-wavelength processes are due to hydrodynamic 
modes in the system. By using projection operator techniques, it is possible 
to introduce projections onto the space of hydrodynamic modes and onto the 
space orthogonal to it. We can then write the spectral density matrix for 
hydrodynamic fluctuations in a very general form and show that hydrodynamic 
modes arise not only from conserved quantities but also from broken 
symmetries which occur at a continous phase transition. 


10.B. NAVIER-STOKES HYDRODYNAMIC 
EQUATIONS [1-3] 

The Navier-Stokes equations describe the macroscopic behavior of an isotropic 
fluid of point particles out of quilibrium. They are essentially the macroscopic 
“balance equations” (cf. Appendix A) for the quantities that are conserved 
during collision processes on the microscopic scale. The conserved quantities 
for an isotropic fluid of point particles include the particle number, the 
momentum, and the energy. The balance equations for the conserved quantities 
contain no source terms because the conserved quantities cannot be created or 
destroyed on the microscopic scale. In addition to the balance equations for 
conserved quantities, it is essential to write the balance equation for the entropy 
density of the fluid. Entropy is not a conserved quantity. For a fluid in which 
irreversible processes can occur, there will be an entropy source term. The 
entropy source term in a fluid is the hydrodynamic equivalent of the Joule 
heating which occurs in an electric circuit which has resistance. The entropy 
source term enables us to identify generalized forces and currents in the fluid. 
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The conductance in a fluid (the proportioality constant between force and 
resulting current) is called a transport coefficient. Once the transport 
coefficients for the fluid have been identified, we can write the Navier-Stokes 
equations. 


10.B.1. Balance Equations 

In this section we will derive the balance equations for the mass density, 
momentum density, energy density, and entropy density for an isotropic fluid of 
point particles. The balance equation for the mass density is especially simple 
since particles can’t be created and there can be no dissipative particle currents. 
The balance equation for the momentum density is based on Newton’s second 
law. Momentum can only be created in a fluid of external unbalanced forces act 
on the fluid, and therefore there can only be a momentum source term if 
external forces are present. The balance equations for the energy density and the 
entropy density can be written in a general form, and thermodynamic relations 
can be used to relate them. 


10.B.1.1. Mass Balance Equation 

In the absence of chemical reactions, the number of particles entering and 
leaving a collision in a fluid will be conserved. For nonrelativistic processes, the 
total mass of the particles involved in the collision process will also be 
conserved. As a result, if we consider a volume element of the fluid, V(t ) (with 
a given set of fluid particles), which moves with the fluid, the amount of mass 
inside this volume element must remain constant. Let p = p{ r, t) denote the 
mass density (mass per unit volume) and let M denote the total mass in the 
volume, V(t). Then 


dM 

~dl 


d_ 

dt 


V(t) 


pdV = 


v{t) 


(dp 

\dt 


+ pv r • v ]dV = 0, 


( 10 . 1 ) 


where v = v(r, t) is the average velocity of the fluid at point r and time t, and 
we have used Eq. (A.16) in Appendix A. Since the volume element, F(t), is 
arbitrary, the integrand must be zero and we find 


+ pV r • v = 0. (10.2) 

dt 

If we note that the convective derivative is given by d/dt = d/dt + v • V r , then 
we can also write 


dp 

dt 


+ V r • (py) = 0, 


(10.3) 


The quantity, J = p\ is the mass current or mass flux and has units mass/ 
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area time. It is also a momentum density. The derivative, dp/ dt , gives the time 
rate of change of the mass density for an observer moving with the fluid. The 
derivative, dp/ dt, gives the time rate of change of the mass density for an 
observer at a fixed point in the fluid (cf. Appendix A). Equation (10.3) is 
sometimes called the continuity equation. It is a direct consequence of the 
conservation of mass in the fluid. 


10.B.1.2. Momentum Balance Equation 

The total momentum, P(t) = p \dV, of the volume element, V(t), evolves 
acording to Newton’s law. The time rate of change of the momentum, P(t), must 
be equal to the sum of the forces acting on the volume element, F(f). Therefore 
we can write the equation of motion of the fluid element in the form 


dP(t) 

dt 



(10.4) 


where F is an external force per unit mass which couples to the particles inside 
the volume element (it could be an electric or magnetic field for example), f is a 
force per unit area acting on the walls of the volume element, and S(t) denotes 
the surface of the volume element, V{t). The surface force, f, is due to the fluid 
surrounding the volume element. It will always have a component 
perpendicular to the surface of V(t), and for a nonideal fluid (a fluid with 
dissipation) it will have a component tangent to the surface. 

If we write a differential surface area element as a vector, dS, directed 
outward perpendicular to the surface, then we can write fdS— dS • P, where P is 
the pressure tensor, f = n • P, and n is a unit vector directed outward 
perpendicular to the surface. The pressure tensor has nine components. In 

Cartesian coordinates it can be written P = P^xx + P*yXy + b P a zz, where 

x, y, and z are unit vectors in the x, y, and z directions, respectively. The unit 
vector, n, can be written n = n x x + n y y + n z i, where n x , n y , and n z are 
components of n in the x,y, and z directions, respectively. Note that the ith 
component of the vector, f, can be written njPji, where i = x,y,z and 

j = x,y,z. If we use Gauss’s theorem, we can write 

dS-P=[ dVVr- P (10.5) 

Js(0 Jv(r) 

Here V r • P is a vector those ith component is (V r • P) i = Ylj ®jPji ~ 
dP X i/dx + dPyi/dy + dP Z i/dz and d z = d/dx, d y = d/dy , and d z = d/dz. Then 
the argument of Eq. (10.4) must satisfy the equation 

^ + pv(V r -v) = pF + V r -P 
dt 

I c f. Appendix A, Eq. (A. 16)]. 


( 10 . 6 ) 
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For an ideal fluid (no dissipation) the only force on the walls of V(t) is due to 
the hydrostatic pressure, P = P( r, t), which is always perpendicular to the walls 
and pointed inward. Thus, for an ideal fluid we have f = — Ph and P = — PU(U 
is the unit tensor, U = xx + yy + zz). For a nonideal fluid, there is also a 
dissipative contribution, A, to the pressure tensor and Eq. (10.6) takes the form 

~ + pv(V r • v) = pF - V r P - V r • A (10.7) 

since V r • (PU) = V r P. The tensor A is called the stress tensor. If we make use 
of the convective derivative, d/dt = d/dt + v • V, we can also write Eq. (10.7) 
in the form 


-i- + V r • (PU + pvv + A) = pF. (10.8) 

ot 

The term pvv is the momentum flux. It is a nine-component dyatic tensor, 

pvv = pv x v x xx + pv x v y x y H 1- pv z v z zz. Equations (10.7) and (10.8) are 

alternative versions of the momentum balance equation for an isotropic fluid 
of point particles. Equation (10.7) describes the time rate of change of 
momentum density as seen by an observer moving with the fluid, and Eq. (10.8) 
describes the time rate of change of momentum density a seen by an observer at 
a fixed point in space. 

10. B. 1.3. Energy and Entropy Balance Equations 

The energy and entropy balance equations are easy to write down. We will let e 
denote the energy per unit mass, and let pe denote the energy per unit volume of 
the fluid. For the case when the external force has the form F = — V r ^>(0 is a 
potential energy per unit mass), the energy density can be written pe = 
pu + \ pv 2 + pq b, where u is the internal energy per unit mass. Thus the energy 
density has contributions from thermodynamic processes in the fluid, the kinetic 
energy of the fluid, and the external potential energy. 

Since the energy inside the volume element cannot be created or destroyed 
by collisions, the only way the energy inside the volume element can change is 
by flow through the walls of V(t). The balance equation for the total energy can 
be written 

^ + V r -(J* + Jf) = 0, (10.9) 

where l R e is the reactive (nondissipative) energy current and jf is the dissipative 
energy current. Both jf and jf have yet to be determined. 

The entropy balance equation can be written in an analogous fashion. 
However, now we must take account of the fact that entropy can be created 
inside the volume element due to spontaneous processes inside of V(t). Let s 
denote the entropy per unit mass (the specific entropy) and ps denote the 
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of + v r • (Jf + (Jf ) = &s, (10.10) 

where jf is the reactive entropy current, jf is the dissipative entropy current, 
and cr s is an entropy source term due to disspative processes inside of V(f). The 
explicit forms of jf , jf , and cr s will be determined in the next section. 

10.B.2. Entropy Source and Entropy Current 

Let us now assume that spatial variations of deviations from thermodynamic 
equilibrium occur on very large length scales compared to microscopic 
distances. Then locally the system will be in equilibrium, but the actual values 
of thermodynamic quantities vary slowly in space and time. The fundamental 
equation for the internal energy is pu = Tps — P + pp, where p is the chemical 
potential per unit mass [cf. Eq. (2.66)]. If we take the differential of this 
equation and use the Gibbs-Duhem equation, pdp + psdT — dP = 0, we find 
d(pu) — Td(ps) + pdp. Therefore, we can write 

dpu dps _dp _dpu dps _ dp 

dt dt ^ dt dt dt ^ dt (10.11) 

+ V • V r {pu) - T\ • V r (ps) ~ p\ • V r (p) = 0, 

where we have made use of the convective time derivative. Let us next note 
that 


V • Vr(/0M) - T\ • V r (ps) - p\ ■ V r (p) 

= V r • [(/ pu - Tps - pp)\] + psy ■ V r T + p\ ■ V T p + -PV r • v (10.12) 
= V • ( psV r T + pVrp - VrF) = 0, 

where we have used the fundamental equation, pu = T ps — P + pp, and the last 
term is zero because of the Gibbs-Duhem equation, pdp + psdT — dP = 0. 
Therefore 


dpu _ T dps_ _ _dp _ dpu _ T dps _ ~dp_ n 
dt dt ^ dt dt dt ^ dt 


(10.13) 


Let us now assume, for simplicity, that all external forces are zero so that 
F - 0. Then the total energy per unit volume is pe = pu + \ pv 2 , and we can 
write 


dt dt dt 2 dt 


(10.14) 
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If we combine Eqs. (10.13) and (10.14), we find 


dps _dpe _ dp\ f 1 2 \ dp 

T dt dt dt ~\2 V dt 


(10.15) 


We can now use the balance equations to obtain an expression for the 
entropy current and source. If we combine Eqs. (10.3), (10.8), (10.9), and 

(10.15), we obtain 

= _Vr ' (J " + J ° ) + v[Vr ■ (/>0 + pyy + n)1 ~{b 2 ~ Vr ' {py) ■ 

(10.16) 

This equation can be simplified further. Note that 

V • [V r • (pw)] Vi^jipVjVi) = X][/0 v « v ;( V ; v «) + V f Vj{pVj)}. 

i J i j 

(10.17) 

Note also that 

^j(p v j\ v f) = ^vjVjipvj) + pvjViVjVi. (10.18) 

If we combine Eqs. (10.17) and (10.18), we find 

V • [V r • (pw)] = ~V 2 V r • (pv) + ^ V r • (pv 2 v). (10.19) 

Let us now substitute Eq. (10.19) into Eq. (10.16) and make use of the Gibbs- 
Duhem relation, psV r T + pV r p — V r P = 0. Then after some rearrangement 
we find 


T-jjf = -v r • (j; + jf - pp"v - n • v) + psy • v r r - n .- v r v, ( 10 . 20 ) 

where fl : V r v = 5^. ^ Ily-ViV,- anc * P" ~ A + 2 y2 - We can now write Eq. 
(10.20) in the form of a balance equation. If we divide through by the 
temperature, we can rearrange terms to obtain 

/ jf+jf- w "v-n-v 

dt Vr v T 

+ (Jf + Jf - w"v - psTv - n • v) • V r (i) - In : V r v. 
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We can simplify Eq. (10.21) still further if we first write its form for a 
nondissipative fluid. 

Consider a fluid in which no dissipative processes occur. By definition, the 
stress tensor, II, equals 0 and the dissipative energy current, jf , equals 0 for 
such a fluid. Also there will be no entropy source term. Therefore, from 
Eq. (10.21) the reactive energy current must be given by 

jf = pp!'\ + psT\ = p(h + ^v 2 )v = p(u + P + ^v 2 )v, (10.22) 

where h is the enthalpy per unit mass. If we substitute Eq. (10.22) into 
Eq.( 10.21) and set fl = 0 and jf = 0, we obtain the entropy balance equation 
for a nondissipative fluid, 


dps 

dt 


-V r • (pS\). 


(10.23) 


The entropy current in a nondissipative fluid is ps\. 

If we substitute the definition of the reactive energy current, jf, into Eq. 
(10.20), we obtain the final form of the entropy balance equation for a 
dissipative fluid: 

~ = -V r ■ (psy + Jf) - i Jf • V r T - in : V r v, (10.24) 

where the dissipative entropy current, Jf , is 

Jf = i(jf-n-v). (10.25) 

The entropy source in a dissipative fluid is 

<7 s = -ijf-V r r-in:V r v. (10.26) 


One can easily check that the quantity jf • V r T + fl : V r v has units (energy/ 
volume • time). It is the hydrodynamic version of Joule heating. In an electric 
circuit, Joule heating is given by J • E, where E is the electric field and J is the 
electric current. This is the (energy/volume • time) dissipated in the electric 
circuit. We see that the hydrodynamic entropy production has the same form if 
w e identify Jf and II as the generalized currents and V r T and V r v as the 
generalized forces driving those currents. 

In an electric circuit the electric current and the electric field (driving force) 
^e related by Ohm’s law, J = crE, where a is the conductivity of the electric 
Medium. Hydrodynamic media also have a version of Ohm’s law in which the 
driving forces are linearly related to the currents. In hydrodynamic systems the 
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“conductivities” are called “transport coefficients.” In the next section we 
will obtain the transport coefficients for an isotropic fluid of point particles, 
and in so doing we will be able to write the Navier-Stokes equations for the 
fluid. 


■ EXERCISE (10.1) (a) Write the dyatic tensor V r v in orthogonal 
curvilinear coordinates. Assume that the coordinates have unit vectors ei, e 2 , 
and 63, coordinates mi, u 2 , and M3, and scale factors h\,fi2, and I13. (b) Write 
the dyatic tensor V r v for the special case of cylindrical coordinates 
mi = r, M2 = 0, and M3 = z with scale factors h\ = 1, /12 = r, and h 3 = 1 and 
unit vectors §1 = r,e2 = <j>, and 63 = z. 

Answer 


(a) In orthogonal curvilinear cordinates the gradient is 

_ ei d e 2 d e 3 d 

V = — 1-— — 

h\ du\ /12 d u 2 hi duj ’ 


(1) 


and the velocity can be written v = v^i + v 2 e2 + v 3 e3. Let jc, denote 
Cartesian coordinates, where jci = x, X 2 = y, and x 3 = z. Unit vectors 
in Cartesian coordinates will be denoted x a with a = 1,2,3. 
(Cartesian coordinates are orthogonal curvilinear coordinates with 
all scale factors equal to one.) In evaluating the dyatic, it is useful to 
note that derivatives of the Cartesian unit vectors x a with respect to 
the generalized coordinates mi,M 2, and M3 are zero. That is, 
dx a /dui = 0. The unit vectors e, are related to the Cartesian unit 
vectors through a transformation 

e< = 53 Ri,a( u 1 , «2, «3)x a and x a = ^3 R a} ( M i > M 2, M3 )e f . (2) 

a i 


We can now make use of the above relations to write the dyatic tensor, 
V r v in the form 


e,e ; - dvj .. d R j,<* D -i 




hi Out . 

i J i j k a 

(b) In cylindrical coordinates we have 


r \ / cos (4>) sin(0) 0 N / x 

4> = — sin(^) cos(0) 0 y 
z/ \ 0 0 1 / \ z 


r; 


ht } d Ui a ' k ' 


(3) 


(4) 
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10.B.3. Transport Coefficients 

Transport coefficients are the generalized conductivities of a hydrodynamic 
system. Before we can determine how many transport coefficients are needed 
for our isotropic fluid of point particles, we must determine how many 
independent driving forces and currents there are in the fluid. In the last section 
we found that the generalized driving forces are V r T and V r v and the 
generalized currents are jf and ft. Both V r v and fi are nine component 
tensors, and each can be decomposed into three orthogonal parts. For example, 
we can write the dyatic, V r v, in the form (see the insert “Useful Mathematics — 
Tensors”, below) 


V r v = i (Vr . v)U + [Vrv]' 5 + [V r v] fl . (10.27) 

In Cartesian coordinates, we have 


[V r v]J = I (djv, + d,Vj) - ±4;V r • v and [V r v]“ = I (% - a,vj). (10.28) 


Similarly, we can write the stress tensor as 

n = f [Tr(n)]u + n s + n°. (10.28) 

For an isotropic fluid of point particles or particles which interact via a 
spherically symmetric potential, the antisymmetric part of the stress tensor is 
identically zero. The antisymmetric “force,” [V r v] a = V r x v, is the curl of the 
velocity. It is the contribution from vortex motion in the fluid. For a fluid of 
particles which interact via spherically symmetric forces, there is no way to 
couple rotational motion of the particles to vortex motion of the fluid. 
Therefore, there is no way to dissipate energy from the fluid flow by 
transforming angular momentum of the fluid into angular momentum of the 
particles. Thus, for a fluid of point particles, IT = 0. For a fluid of particles 
w hich interact via nonspherically symmetric forces, there will be coupling 
between angular momentum and translational flow [4]. 
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■ Useful Mathematics —Tensors 

An arbitrary tensor, T, can be decomposed into three orthogonal 
components. In cartesian coordinates we can write the nine component 
tensor in the form 

T = T^xx + T^xy + • ■ • + T a xz. (1) 

The nine terms in Eq. (1) can be regrouped into three orthogonal quantities. 
We may write T as 

T = ±Tr(T)U + T s + T fl , (2) 

where U = xx + yy + zz is the unit tensor and Tr(T) = T xx -\- T yy + T a . In 
Eq. (2), T * is a symmetric tensor with zero trace and is defined as 

T s = ±(T + T r )-iTr(T)U, (3) 

where T r is the transpose of T. The symmetric tensor may also be written in 
the form 

T s = T s XJC xx + T s xy xy + --- + T s zz zz, (4) 

where T = \ T„ - 1 T„ - 1 T a , \ (T„ + T, x ), and so on. In Eq. (2), 
T a is an antisymmetric tensor with zero trace and is defined as 

j« = i(X_T 7 ’). (5) 

The antisymmetric tensor may be written in the form 

t a = T a xx xx + T^xy + zz, (6) 

where T a ^ x = 0,T a xy =\{T xy - T yx ), L and so on. 

If we introduce a second tensor, V, then it also can be decomposed in the 
manner described above. The scalar product of the two tensors can be 
written 

T . V = i[Tr(T)][Tr(V)] + T s : \ s + T a : \ a . (7) 

One can also show that 

U:T i =U:T a = T" :V a = 0. (8) 

Thus, the tensor, T, can be decomposed into three independent orthogonal 
components. 
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In order to write “Ohm’s law” for the fluid, we shall make use of Curie’s 
principle, which states that in an isotropic fluid, a given force cannot drive a 
current of a different tensor character. Let us note that there are four forces in 
the fluid, each with a different tensor character. They are V r T, a polar vector; 
[V r v] 5 , a symmetric tensor; V r • v, a scalar: and [V r v] a = V r x v, an axial 
vector. (Polar vectors and axial vectors behave differently under inversion 
through the origin of coordinates. Polar vectors change their sign. Axial vectors 
do not.) Since the antisymmetric part of the stress tensor is zero, the axial vector 
cannot drive a current. Thus, an isotropic fluid of point particles has three 
generalized forces. They are V r 7\ V r • v, and [V r v] s . They drive the general- 
ized currents, Jf,II = jTr(fi), and fP. Since they are all of different tensor 
character, we find the following generalization of Ohm’s law for the fluid: 

J? = -fv,7\ (10.29) 

where K is the coefficient of thermal conductivity, 

= —2 r?[V r v] s , (10.30) 

where rj is the coefficient of shear viscosity, and 

n = -CV r -v, (10.31) 

where £ is the coefficient of bulk viscosity. 

If we use Eqs. (10.29)-(10.31), the entropy production takes the form 

|v r 7f + 2| |[V r v]f + | |V r ■ v| 2 , (10.32) 

Where |V r 7f = (V r T) • (V r T) and j[V r v] s | 2 = [V r v] s : [V r v] s . The mass 
balance, momentum balance, and entropy balance equations take the form 

^ + V r (pv)=0, (10.33) 

fjf + V r • (pvv) = -V r P + r;Vpv + (( + \n) V r (V, • v). (10.34) 

and 

d -§ + V r • (psv + Jf ) = £ \ VrT-p + 2| |[V„ v]f + i (V, • v) 2 . (10.35) 

respectively. The derivation of the specific form of Eq. (10.34) is given in 
Exercise (10.2). Equations (10.33)-(10.35) collectively are called the Navier- 
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Stokes equations or hydrodynamic equations. They describe the hydrodynamic 
behavior of an isotropic fluid of point particles. In the next section we describe 
how to solve the linearized version of these equations. The linearized 
hydrodynamic equations describe the behavior of the fluid if it is only slightly 
displaced from absolute equilibrium. They are also useful in describing the 
behavior of fluctuations for a fluid in equilibrium. 


■ EXERCISE 10.2. Prove that 

V r ■ n = — pVjv - (C + ^)V r (Vr ■ V) 

if IF = — 2 t 7 [V,.v] s and II = — £(V P • v) with fi = IIU + fF. Note that U is 
the unit tensor. 

Answer: The stress tensor is fl = —2r)[V r \} s — CU(V r • v), where 

[VrV]J = Jftv, + *») - &V r ■ V. (1) 

The f/th component of the stress tensor is 

fly = -T)(djVi + diVj) + (§77 ~ C)<5yV r • v. ( 2 ) 

Then 

(Vr • n ) ij = -di[r)(djVi + diVj) - (fi? - C)<5yV r • v] 

= ~[r]dj{diVi) + rjdfvj - %q - QdjV r • v] (3) 

= ~r}dfvj ~{\q + QdjVr ■ v. 

Thus 

V r • n = - 7 )Vjv - (C + 5 ^)V r (V r • v). (4) 


10.C. LINEARIZED HYDRODYNAMIC EQUATIONS [5] 

The hydrodynamic equations, (10,33)— (10.35), depend nonlinearly on the 
thermodynamic variables and the average velocity. This makes them very 
difficult to solve and one generally must resort to numerical methods for solving 
them. However, if we restrict ourselves to the neighborhood of absolute 
equilibrium, then we can look at the behavior of small deviations from the 
equilibrium state, and it is possible to linearize the hydrodynamic equations. 
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10.C.1. Linearization of the Hydrodynamic Equations 

Let po, To, so, and P 0 , denote the equilibrium mass density, temperature, the 
specific entropy, and pressure, respectively. Then close to the equilibrium state, 
we can write p( r, t) = p 0 + Ap(r, t), T = T 0 + AT(r, t), .s(r, t) = s Q + As(r, t), 
and P( r, t) = Po + AP(r, /), respectively. The average velocity, v(r, t), is zero 
at equilibrium, so it is already first order in deviation from equilibrium. If we 
now substitute these expansions into the hydrodynamic equations (10.33)- 
(10.35), and only retain terms to first order in deviations from equilibrium, we 
find 


^ + p„V r -v = 0, (10.36) 

fl0~ = -VrA P + + (C + h) V r(Vr • V), (10.37) 


and 


*if=t 0 v2 ' at - (1038) 

To obtain the final form of Eq. (10.38), we have used Eq. (10.36) to eliminate 
two terms and we have used Eq. (10.29). 

The momentum balance equation, (10.37), is actually three equations since 
the velocity has three components. Thus, we have five equations but we have 
seven unknowns, Ap, As, AT, AP,\ x ,\ y , and v z . In order to solve these 
equations, we can use thermodynamics to reduce the number of unknowns to 
five since the quantities Ap, As, AT, and AP are related by thermodynamic 
equations. We can choose two of them to be independent and expand the other 
two in terms of them. The choice of the two independent variables depends on 
convenience for the problem of interest. 

Let us choose Ap and AT to be independent. Then we can write 

As(r, t) = g)°A p{v,t) + ^V(r,i) (10.39) 


and 


/ QP\ 0 TdP\ 0 

= fc) r Mr ’ ()+ {wf) AT ^- 


(10.40) 
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The linearized hydrodynamic equations take the form 


dAp 

dt 


+ PoV r • V = 0, 


(10.41) 


d\ /dP\ / dP\ 

pQ ~dt = ~\dp) r VrAp “‘Vary VrAr + ^ V r v + (C + h)v T (y r • v), 

(10.42) 

and 


Po 


ds\° dAp / ds\°dAT 
dp) T ~dT + Po [df) ~dT 


^v?Ar. 

T 0 r 


(10.43) 


Equations ( 1 0.4 1)-( 10.43) form a set of coupled equations which describe the 
behavior of small deviations from absolute equilibrium. 

It is useful to consider the behavior of the linearized hydrodynamic equations 
for an ideal fluid in which the transport coefficients are K = 0, rj = 0, and 
C = 0. Then Eq. (10.43) gives 


dAT _ / dT\°/ds\°dAp fdT\°dAp 

dt \ ds ) p \Pp)t dt ~ \ d p)s dt ’ 

and Eq. (10.42) reduces to 


dx 


Po dt 




o 

VrAr. 

p 


(10.44) 


(10.45) 


If we now take the time derivative of Eq. (10.45) and use Eqs. (10.41) and 
(10.44), we obtain 


d 2 x 

w 


fdP \°_ _ 2 
( — l V 2 v — 0 

\0p). r ' 


(10.46) 


where we have used the fact that (dP/dp)° s = (dP/dp)° T + (dP /dT)° p (dT jdp) 0 s - 
Equation (10.46) is the wave equation for longitudinal velocity variations. It 

describes the behavior of sound waves with speed, c = yJ(dP/dp)° s , in the ideal 
fluid. Thus, an ideal fluid supports sound waves whose speed is determined by 
the adiabatic compressibility of the fluid. 

In a dissipative fluid, sound modes can be damped. We will now determine 
the dispersion relations for normal modes of the dissipative fluid. As a first 
step, let us Fourier transform the space dependence of the linearized 
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hydrodynamic equaions. We let Ap(r,/) = (1/27 t) 3 j dkp k (t)e~ lkr so that 
^(t) = J dr Ap(r, t)e ,kr . Similarly, v(r,f) = (1/27 t) 3 Jdkv k (f)e~' kr and 
j(r,t) = (1/27t) 3 Jdk7 k (r)e"' kr . If we substitute these expressions into Eq. 
(10.41)-(10.43), we find 

^ - ip„k v„ = 0, (10.47) 


dvk 
p0 dt 


and 



o 

kpk + i 
T 



0 

kr k - rjk 2 v k - (C + H k ( k • v k ), 

p 


(10.48) 


Po 


/ ds\ °<9pk 
\dp/ r dt 


+ Po 


dTj~dr 


K 

To 


k 2 T k , 


(10.49) 


where (\ = pk(r), 7k = r k (f), and v k = v k (f). 

We can simplify Eqs. (10.47)— ( 10.49) still further. Let us divide the velocity 
into longitudinal (parallel to k) and transverse (perpendicular to k) components: 


Vk(0 = vj|(f)k + v£(f), (10.50) 

where k-v^(r)=0 and k • v k (t) = kvj[(f). The component kvj[(r) is the 
amplitude of a longitudinal velocity variation which has wavelength A = 2n/k. 
These are velocity variations in the direction k. The velocity vector, (t), is a 
two-component vector which describes velocity variations with wavelength 
A = 2n/k, but transverse (perpendicular) to the direction of k. 

The transverse and longitudinal components of the average velocity decouple 
in the linearized hydrodynamic equations. They evolve according to different 
equations of motion. From Eqs. (10.47M 10.49) and (10.50), we can write 


dpk 

dt 


- ipokv'l - 0, 


(10.51) 


dvl ( dP\° fdP \ °_ _ , I, 

Po ¥ = ' i w r A + lt (ar) /) T ‘“^ + ^ ^ ’ 

(ds\°dp k fds\°dT k K~ 

Po \dp) T dt +p0 \dTj dt ~ T 0 k k ’ 


(10.52) 

(10.53) 


and 


Po 


dv k L 


- rjk 2 \ k ± . 


dt 


(10.54) 
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Equations (10.51)— (10.53) govern the evolution of the coupled heat and sound 
modes (sound consists of longitudinal oscillations of the fluid). Equation 
(10.54) governs the evolution of shear modes (transverse velocity oscillations). 

We want to solve the hydrodynamic equations, given some set of initial 
conditions / 9 k( 0 ), 7k (0), and Vk(0). It is useful to introduce the Laplace 
transform 


•00 If 

Pk(z)~ dtpk(t)e~ zt and pk(t) = — dzp k {z)e z \ (10.55) 

J0 Z7TIJ g_ ioo 


where 6 is a positive number. The Laplace transform of the time derivative is 


0Pk(O _-zt 

o dt 


d t --~i- e zt = -pk(0) + zpk(z). (10.56) 


Similar Laplace transforms can be written for the other variables. 
Let us now Laplace transform Eqs. (10.51)-(10.54). We find 


zh(z) - IA>*vil (z) = Pk(0), 


(10.57) 


Pozv{(z) - iki-^Uz) - ik (^j T k (z) + (C + %n)k 2 vl (z) = povj[( 0 ), 

(10.58) 

W (l)r* (z) + W 0/ k(z) + % klUz) 

= Po {S) T Pk{0)+P °(^} p Tk{0) ’ (1 °' 59) 

and 

Pozy k {z) + rjk 1 vj[ (z) = poVk (0) . (10.60) 

Let us now simplify these equations. First we note the following 
thermodynamic identities 


fdP\ _ £o 

\dp) T 7 


and 


dP\ __ ppclap _ p 0 cp - c p 
dTj p 7 T 0 exp 


(10.61) 


where c p = To(ds/dT) p and c P = To(ds/dT) p are the specific heats at constant 
density and pressure, respectively, c 0 = VWIW, is the speed of sound, 
a P = —(l/po)(dp/dT) p is the thermal expansivity, and 7 = c P /c p . Note the 
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Maxwell relation, ( ds/dp) T = ~{\/ pl){dP/dT) p . Also, let us now introduce 
the longitudinal kinetic viscosity, i/[, the transverse kinetic viscosity, v t , and the 
thermal dijfusivity, x, which are defined 

vi = — (c + rA = — , and X = (10.62) 

po\ 3 ) po Pqc p 

respectively, Then Eqs. (10.57)-( 10.60) take the form 

zfa(z) - ipokvliz) =pk(0), (10.63) 

(z + vik 2 )v\.(z) - ik — pt(z) - ik^^T k (z) = vj[(0), (10.64) 
7Po 7 

-z^h(z)+%(z + TX*)ik(z) = -^Pk(0) + ^r k (0), (10.65) 

po7 T o Pol to 

and 

(z + v t k 2 )\£(z) = Vj[-(0). (10.66) 

Solving the linearized hydrodynamic equations is now simply a matter of 
doing some algebra. Since die transverse modes completely decouple from the 
longitudinal modes, we will consider them separately below. 

10.C.2. Transverse Hydrodynamic Modes 

Let us assume that the Fourier components of the transverse velocity at time 
t — 0 are known to be (0). Then from Eq. (10.66) we have 

(ia67) 

The Fourier components of the transverse velocity at time t are then 

VjJ-(l) = ~ f <+l0 ° & v k( Q K = y± (0)e -«* (10.68) 

2jr!js-ioc z + i'.k 2 kw 

In order to obtain the final result in Eq. (10.68), we have changed the integration 
along the line z = 6 + iy (with y varying from — oo to +oo) to a contour 
integration with the contour shown in Fig. 10.1. We see that any transverse 
velocity variations in the fluid decay in time and cannot propagate. The short- 
wavelength disturbances decay faster than the long-wavelength disturbances. 
This wavelength dependence of the decay time is the signature of a hydro- 
dynamic mode. 
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Fig. 10.1. The contour, C, used to integrate 
Eq. (10.68). 


10.C.3. Longitudinal Hydrodynamic Modes 

The time evolution of the longitudinal modes can be determined from Eqs. 
(10.63)-(10.65). Let us first write them in the following matrix from: 


( 


\-z 


z 

ikc l 

IPo 

CqQ/> 

Pol 


-ipok 
[z + vik 2 ] 

0 


0 

ikc^acp 


\ 


[z + ixk 2 ] ) 


( Pk(z) \ 
Vk(z) 


( 1 0 0\/Apk(0)\ 

0 10 V 1I(0) . 

\-^f 0 n) \Ar k( o)/ 


(10.69) 


If we multiply by the inverse of the matrix on the left in Eq. (10.69), we obtain 


h(z)\ ( Apk(0) \ 

vj(z) = , v k(°) 

Tk(z) / D(k ’ z) V - A ^°) + n A 7 U°) 


(10.70) 


where D(k, z) is the determinant of the 3 x 3 matrix on the left in Eq. (10.69), 
and M/D(k,z) is its inverse. The determinant D(k,z), can be written 

D( k, z) = [z 3 + £{yi + 7%)^ 2 + *(7X^ 4 + elk 2 ) + c 2 0 xk% (10.71 ) 
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The matrix, M, which appears in Eq. (10.70) can be written 

^(z + ^ I 2 )(z + 7x^) + -pootpclk 2 /^ > 

M= ^f(z + X* 2 ) C fA z + 7X ^ 2 ) ikza P cll 7 

V ^z(z + ^ 2 ) z 2 +zvik 2 + k 2C -%) 

(10.72) 

Before we use Eqs.(10.70) and (10.72) to solve the equations of motion, it is 
useful to look more closely at the determinant, Eq. (10.71). 

To second order in k, the three roots of the equation £>(k, z) = 0, are given by 
Zl = -\k 2 and z± = ±ic 0 k - \k 2 [vi + \{l — 1)]- Therefore, for long- wave- 
length disturbances (small k ), D(k, z) is approximately given by 

D( k, z) « (z + X& 2 ) (z + ikc Q -I- Tk 2 ) (z - ikc 0 + Tfc 2 ) , (10.73) 

Jo 

where r = \ [vi + x(l — 1)] and we have neglected terms of higher order in k. 
This approximate expression for the roots of the equation D(k, z) = 0 must be 
used with care. There are cases when the exact expressions for the roots must be 
kept. 

The dissipative fluid has three longitudinal normal modes. The time 
evolution of the density, longitudinal velocity, and temperature variations each 
contain contributions from all three longitudinal normal modes. One normal 
mode is primarily a heat mode and has a decay rate given by z\. The other two 
normal modes are primarily damped sound modes. The sound modes can 
propagate but eventually will be damped out by dissipative processes in the 
fluid, both viscous and thermal. 

The time evolution of the density, longitudinal velocity, and temperature 
variations can be obtained by performing the following integration: 

( Pk(t)\ 1 />£+ioo ( Pk(z)\ 

U« =5- t{« ■ ( 10 - 74 ) 

\mj V T k (z) j 

In Exercise 10.3 we obtain the time dependence of Fourier components of the 
density under special conditions. 


I EXERCISE 10.3. Compute p^{t) assuming that at time t — 0 we have 

Pk(0) 0, v|[(0) = 0, and 7k (0) = 0. Write the amplitude of the evolution to 

lowest order in k. 
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Answer: From Section 10.C.3 we can write 

[z + vik 2 ][z + ixk 1 ] + c 2 0 k 2 { 7 -l)h 


Pk(z) = 


Jz + vik 1 ] [z 4- ic 0 k + Tk 2 }[z - ic 0 k + Tk 2 } 
The density at time t is 


Pk(0). (1) 


i [6+ioc 

Pk(t) = 7T-. dz{*(z)e zt . 

J 6—ioo 


(2) 


If we retain terms in the amplitudes to zeroth order in k, Eq. (2) reduces to 


Pk(t) 


1 — — xk2 ‘ + —e rk2 'cos(cokt ) 
7/ 7 


Pk(0). 


( 3 ) 


Thus, the initial disturbance in the density is eventually damped out. The 
long- wavelength components are the last to go. 


10.D. DYNAMIC EQUILIBRIUM FLUCTUATIONS AND 
TRANSPORT PROCESSES 

Because matter is discrete, systems in equilibrium will undergo fluctuations 
about their equilibrium states. The hydrodynamic equations govern the 
dynamics of the long-wavelength and low-frequency components of those 
fluctuations, and they determine the behavior of dynamic equilibrium 
correlation functions and dynamic structure factors. Onsager was able to use 
this fact, along with the reversibility of dynamical laws on the microscopic 
level, to show that certain relations exist between transport coefficients describ- 
ing coupled transport processes. Hydrodynamic equations (equations which 
describe the long-wavelength, low-frequency dynamics) can be obtained for a 
large variety of systems, including dilute gases, liquids, solids, liquid crystals, 
superfluids, and chemically reacting systems. For complicated systems, 
transport processes are often coupled. For example, in a multicomponent 
system, it is possible to have a temperature gradient drive a particle current and 
a concentration gradient drive a heat current. Some complicated systems can 
have as many as 10 or 20 (or more) transport coefficients to describe the decay 
to equilibrium from the hydrodynamic regime. Onsagar’s relations are of 
immense importance because they enable us to link seemingly independent 
transport processes and thereby reduce the number of experiments that must be 
performed in order to measure all the transport coefficients. 

Once an expression has been obtained for the dynamic correlation function 
of an equillibrium system, we can use the Weiner-Khintchine theorem to derive 
the dynamic structure function which gives the power spectrum of the 
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equilibrium fluctuations. The power spectrum is the quantity which is most 
often measured in experiments. 


10.D.1. Onsager’s Relations [6, 7] 

Systems which are out of equilibrium generally return to the equilibrium state 
through a variety of transport processes which may or may not be coupled to 
one another. Onsagar, without any reference to a particular physical system, was 
able to derive relations between the transport coefficients for coupled transport 
processes. These relations are a general consequence of the invariance under 
time reversal of Hamiltonian dynamic. There are essentially two aspects of the 
proof: (a) It uses the time-reversal invariance of mechanical equations of 
motion at the microscopic level (Newtons’s equations) to determine properties 
of time-dependent correlation functions, and (b) it assumes that fluctuations 
about the equilibrium state decay, on the average, according to the same laws 
that govern the decay of macroscopic deviations from equilibrium — that is, the 
hydrodynamic equations. 


10.D.1.1. Microscopic Reversibility 

The first step in deriving Onsager’s relations is to show that the time-reversal in 
variance of Hamiltonian dynamics requires that the time-dependent correlation 
functions of macroscopic fluctuations, a = (ai, 0 : 2 , . . . , a n ), obey the relations 

(aiajir)) = (^(r)^). (10.75) 

The quantities a,- = A, — A® and a ; = Aj — A J are the fluctuations about the 
equilibrium values of the state variables A, and Aj (cf. Section 7.C). Equation 
(10.75) tells us that the correlation between a fluctuation a, at time t = 0 and a 
fluctuation a y at time t = r is the same as that of a fluctuation aj at time t = 0 
and a fluctuation a, at time t = r. The quantities a, and can correspond to 
fluctuations in the same state variables at different points in space. Thus, Eq. 
(10.75) can also be turned into an equation relating correlations between space- 
and time-dependent fluctuations. 

To establish Eq. (10.75), we note that the correlation matrix (aa(r)) can be 
written 


(aa(r)) = 


dad<x'<x<x'f(aL)P(ai\a', r) 


(10.76) 


where / , (a|a',r) is the conditional probability that the fluctuation has value a' 
at time t = r, given that it had value a at time t = 0 (we are now using the 
notation of Section 5.B). 

For a closed isolated system, /(a) can be written 


/(«) = 


(2 Tvk B ) n 


1/2 


,-g:aa/2 k B 


(10.77) 



554 


HYDRODYNAMIC PROCESSES NEAR EQUILIBRIUM 


[cf. Eq. (7.24)], and the change in entropy which results from these fluctuations 
is 


AS = — - g : oca. 
2 6 


(10.78) 


It is useful to introduce a generalized force, %, which is defined as 

(dA S\ 

x = ga = -(^j' 

and a generalized current, % defined as 

„ da 

* = Tt ' 

Then the time rate of change of the entropy due to fluctuations is 

dAS 


dt 




(10.79) 


(10.80) 


(10.81) 


For a resistor held at constant temperature, where £ is the electric current and / 
is the applied electric field, Eq. (10.81) is proportional to the rate at which 
energy is dissipated through Joule heating. 

We must now remember that a is a macroscopic variable. Thus, for each 
value of a there are many possible microscopic states of the system. We can 
relate the joint probability distribution/(a)P(a|a / , r) for fluctuations a and a' to 
the microscopic probability density in the density in the following way: 




1 


Vae(E) 


(a->a+da) 

(E-.E+AE) 


dq N dp N 
d^ N dp fN P(p N ,q N \p fN ,q fN ,r). 


(10.82) 


(a 


In Eq. (10.82) we have used the fact that ^(p^q^) = fW(£) 1 f° r a closed 
isolated system (£Iae(E) is the volume of the energy shell). The phase space 
integrations are restricted to the energy shell and to trajectories with values of a 
and a' appearing in the left-hand side of Eq. (10.82); P( p^, q^lp'^, q' N , r) is the 
conditional probability that a system be in a state (p /A? , q'^) at time t = r, given 
that it was in the state (p^, q^) at time t = 0. Since classical systems are 
completely deterministic, we must have 


P(p N , q*|p w , q ,N , t) = <5(q w - q* - Aq*(p* q* r)) 
x <5(p w - p* - Ap"(p",qV)), 


(10.83) 


where Aq^ and Ap^ are uniquely determined from Hamilton’s equations. 
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Because Hamilton’s equations are causal and time-reversal invariant, reversal 
of all momenta in the system will cause the system to retrace its steps. This 
implies that 


P(q", p"|q' A ', P* r) = P(q' N , -p"V, -p", r). (10.84) 

Since a and cl' are even functions of momentum, we can combine Eqs. (10.82)- 
(10.84) to obtain microscopic detailed balance 

/(a)P(a|a',r) = f ( cL ') P (< x '\ cL , r). (10.85) 

From Eq. (10.85), Eq (10.75) follows easily. 

10. D. 1.2. Regression of Fluctuations 

Equation (10.75) is important because we can use it to find relations between 
various transport coefficients. Let us first introduce the conditional average 
<a(r)) , which is the average value of a at time t , given that the initial value of 
a was oq. We then can write 


<«w> w = 


daaP(ao|a, t ) 


( 10 . 86 ) 


for the conditional average. 

Onsagar assumed that, on the average, the fluctuations decay according to 
the same linear laws (hydrodynamic equations) that govern the decay to 
equilibrium of systems which are macroscopically out of equilibrium. Thus, the 
average fluctuation, (a( t )) ao , obeys an equation of the form 

J t <«(')>«, = -«■ •<««>„• ( 10 - 87 ) 


Equation (10.87) has the solution 


<«('))«. = (10.88) 

The time derivative in Eq. (10.87) must be used with caution. It is defined in the 
following sense: 


<*(«(Q)«0 _ + r ))«o - W)go 

dt T 


(10.89) 


where r is a small time interval whose values are bounded by inequalities. 


Tq^tCT. 


( 10 . 90 ) 



556 


HYDRODYNAMIC PROCESSES NEAR EQUILIBRIUM 


To is the time between collisions, and T is the time it takes the fluctuation to 
decay to equilibrium. The limitation in Eq. (10.90) rules out fluctuations which 
are too small — that is, fluctuations which decay to equilibrium in a few collision 
times. Similarly, Eq. (10.90) is not valid when the fluctuation has just been 
created. It takes a few collision times for it to settle down to a hydrodynamic 
decay. 

Equation (10.75) imposes a condition on the matrix M. If we expand Eq. 
(10.88) for short times, 


( a (0)«o = ao-? M -ao + 0(f 2 ), (10.91) 

and substitute it into Eq. (10.75), we obtain 

{aoM • «o) = {M • aooo)- (10.92) 

If we now use the fact that M • a = a 7 • M 7 and use Eq. (7.27) for the variance 
in the fluctuations, we obtain 

g 1 -M 7 = M-g~ 1 , (10.93) 

where T denotes the transpose. We can define a new tensor 

L = M ■ g -1 ; (10.94) 


then Eq. (10.93) becomes 


L = L 7 or Lij = Lp. (10.95) 

Equations (10.95) are called Onsagar’s relations. If we make use of the 
generalized force / [cf. Eq. (10.79)], the time rate of change of the fluctuation 
can be written 


|w»)>* = -t-(*(0W («■*) 

Equation (10.96) has the same form as the linearized hydrodynamic equations 
(cf. Section 10.C). Eq. (10.95) is so useful that Onsagar received a Nobel prize 
for deriving it. The matrix L is a matrix of transport coefficients. Equation 
(10.96) tells us that a force resulting from a fluctuation a, can cause a flux of Ay, 
and a force arising from a fluctuation aj can cause a flux of A, . Eq. (10.95) tells 
us that the transport coefficients for the two processes are the same. For 
example, a particle concentration gradient can derive a heat current, and a 
temperature gradient can drive a particle current. The transport coefficients for 
the two processes are the same although the processes physically appear to be 
very different. 
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10.D.2. Weiner-Khintchine Theorem [3, 8, 9] 

The Wiener-Khintchine theorem enables us to obtain a relation between the 
correlation matrix for time-dependent fluctuations and the spectral density 
matrix of fluctuations for ergodic systems with stationary distribution functions. 
We shall first derive some properties of time-dependent correlation matrices and 
then derive the Wiener-Khintchine theorem. 

Let us consider the time-dependent correlation matrix, (a(r)a(O)), for a 
system governed by a stationary distribution function (such as a system in 
equilibrium). The correlation matrix has the property 

C aa (r) = (ot(r)a) = (a (t + r)a(t)) = (a(-r)a) 7 ’ = C aa (-r) r , (10.97) 


where we have let t = — r and T denotes the transpose of the correlation matrix. 
From the condition of microscopic reversibility, we know that 
(a(r)a) = (aa(r)) and, therefore, 

C.«(r) = C„(t) t (10.98) 

Furthermore, from Section 7.C we have 

C«.(0) = (««>=***-' (10.99) 

where g 1 depends on the thermodynamic response functions. From Eqs. 
(10.88) and (10.97) the correlation matrix can be written 


C aa (r) = J ^a 0 /(cxo)ao{a(r)) a[0 = k B g 1 - e M|r| (10.100) 


since M is a self-adjount matrix (|r| indicates the absolute value of r). 

We now will introduce the spectral density matrix of fluctuations and show 
that it is the Fourier transform of the correlation matrix. Let us introduce a 
slight modification of the state variable ot(f) as follows: 


„ (t . t) = I a W M < 

^ ^ “ { 0 \t\>T 


such that 


lim a (t; T ) = a(t). 

T-»x v ' w 

We next introduce the Fourier transform of a(f; T): 

•oo f 'T 

a(<j; T) = dta(t ; T)e iwt = dUx(t; T)e iwt . 

J — oo J —T 


( 10 . 101 ) 


( 10 . 102 ) 


(10.103) 



558 


HYDRODYNAMIC PROCESSES NEAR EQUILIBRIUM 


Since the fluctuations, a(r), are real, we find 

T ) = a(-w; T) 

(* denotes complex conjugate). 

The spectral density matrix is defined as 

= lim ;; T)a(ur, T) 

T—+oc l 


(10.104) 


(10.105) 


(cf. Section 5.E.2). Combining Eqs. (10.102) and (10.104) we can write 




dre iuT lim i 
T—*oc T 


dta.(t; T)a(t + r; T). 


(10.106) 


If we now invoke the ergodic theorem (cf. Section 6.C), we can equate the time 
average in Eq. (10.106) to the phase average of the fluctuations: 


( aa ( r )> = Km - 


1 f 


= Am — 

T—*oo T 


-T 
j roc 


a(r)a(r + r)dt 
a (/; T)a(t + r; T)dt. 
Then Eqs. (10.106) and (10.107) lead to the relation 


(10.107) 


S««(u;) — 


dre lun (aa(r)) = | 


dre^C^ir). 


(10.108) 


Thus, the spectral density matrix is the Fourier transform of the correlation 
matrix. Equation (10.108) is called the Wiener-Khintchine theorem. 

We can now derive some general properties for the spectral density matrix. 
From Eq. (10.97) we find that the spectral density matrix is Hermitian, 


S«a(tu) — 


dre^C^-T) 7 = 


(10.109) 


where we have let r — > -r to obtain the last term. Furthermore, since C aa (t) is 
real, we have 


s;w 


•OO 

rfre-^C„(-r) = S„(-w), 

J — OO 


(10.110) 


It is useful to divide S aa (u;) into its real and imaginary parts, R«o(^ ; ) and I aa (u;), 
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respectively: 

S „(w) = R „(w) + flU(o;). (10.111) 

Then from Eqs. (10.109) and (10.110) we find that R„(u;) is a real, symmetric 
matrix and an even functin of u (R^u;) = R aa (a;) :r = R aa (— a;)), while I aa (u;) 
is a real, antisymmetric matrix and an odd function of uj (I« a (^) = 
— Iaa^) 7 = — I aa (— cu)). Furthermore, we find that S aa (a;) = S aa (a;) . Thus, 
^(a;) = — Iaa(^) = 0, and the spectral density matrix is a real, symmetric 
matrix and an even function of uj. We therefore can write the correlation matrix 
in the form 

1 00 1 00 

Coa(r)=— duje~ luJT S aa (-uj) = - rfa;R aQ[ (a;)cos(a;r). (10.112) 

, —go 7T 0 

If a magnetic field is present or the system is rotating, Eq. (10.112) must be 
generalized to take into account the behavior of the magnetic field or angular 
velocity under time reversal [3]. This is done in Section S10.A. In Exercise 
10.4, we compute the power spectrum of density fluctuations in an isotropic 


■ EXERCISE 10.4. The dynamic density correlation function for a fluid in 
a box of volume V may be written 

1 

C nn (p, r) = - dr(n{r + p, r)n { r , 0) , 

where N is the total number of particles. The dynamic structure factor is 
given by 

• OO 

S m ( k,n)= dp drC nn (p, r )e ikfi e ~‘°' T . 

— OO 

Compute both C„„(p, r) and S„„(k,Q) for a fluid of point particles in 
equilibrium. For such a fluid, the average, (), is taken over the equilibrium 
distribution of fluctuations. Assume that fluctuations in the average velocity 
are independent of temperature and density fluctuations. 

Answer: 

(a) Let us first write the density variations as n(r,t) = «o + An(r, t), 
where no = (n(r,t)) = N/V, is the equilibrium density and An(r, t) 
describes fluctuations about equilibrium such that (An(r, t)) = 0 
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If we substitute into the expression for the correlation function, we 
obtain 


C nn { P, r) = ipr(n(r + p, t)n{r, °)}> 

= n ° + j dr ( An ( r + p ’ 0 An ( r > °)) • 

Let us introduce the Fourier series expansion of the density 
An(r,t) =^-^2e~ lkr n k (t) with n k (t) = 


(1) 


dre lkr An(r, t), (2) 


and of the correlation function 


C„„(k, r) = f V kp C m ( p, r) = OT k ,o + ^ (« k (/)«- k (0)). (3) 


To obtain the last term in Eq. (3), we have substituted the Fourier 
expansion of the density, Eq. (2), into Eq. (1) and have used the 
identity 


1 

V 


dre‘ kr = < 5 k0 , 


( 4 ) 


where ^k,o is the Kronecker delta function. From Einstein fluctuation 
theory (Section 7.C) we know that temperature and density fluctua- 
tions are statistically independent. Therefore, (An k (0)AT- k (0)} = 0. 
Also, we will assume that velocity fluctuations are statistically 
independent of density fluctuations. Therefore, any contributions to 
the density at time t, due to fluctuations in velocity or temperature at 
time t = 0, will disappear when the average, {), is taken. As a result, 
we can use the solution for the density, = mn k (t), obtained in 
Exercise 10.3. to determine the time evolution of the density 
correlation function. We can write 


C MM (k, t ) = N6 k o + (nk(O)n-k(O)) 


x j^l--^e xk2 ^+^e ™ 2 |f| cos (c 0 kt) , 


( 5 ) 


where 7 = cp/c p , cq is the speed of sound, T = \ [vi + x(l ~ 1 )]* W is 
the longitudinal kinetic viscosity, and x is the thermal diffusivity. The 
absolute value of the time appears in Eq. (5) because it is the 
correlation function for a stationary process. We must remember that 
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Eq. (5) is an approximate expression for the density correlation 
function. Some terms proportional to the wave vector have been 
neglected. We shall leave it as a homework problem to determine the 
size of these neglected terms. 

(b) Let us now consider the spectral density function (also called the 
dynamic structure factor), 


Snn(K O) — 


dp 


dtC„ n ( p, 


( 6 ) 


If we substitute Eq. (1) into Eq. (6) and made use of Eqs. (3) and (5), 
we find 


S nn ( k, O) = 2nN6(fl)6 kfi + (n k (0)n_ k (0)) 


2 xk 2 


7/ + X 2 * 4 


1 

+ - 


r k 2 


+ 


Vk 2 


i Wo - c 0 ky + r 2 * 4 (o + c 0 ky + r 2 * 4 


(7) 


where we have used the fact that ( 1 /2n) dte in ' = 6(0). Away 

from the critical point, the correlation function, (n k (0)n_ k (0)), is 
independent of k and proportional to the compressibility. Near the 
critical point it behaves like (C + k 2 )~ l , where C ~ (T - T c ) [cf. Eq. 
(8.21)]. The dynamic structure factor for an isotropic fluid has three 
terms of the form 


L 


m) = 


2A 

A 2 + (O - O') 2 ’ 


( 8 ) 


where /(O) is a Lorentzian centered at frequency, O', with half- width 
at one-half maximum given by A. Therefore, the dynamic structure 
factor, as a function of frequency, has three peaks. One is centered at 
0 = 0 with width xk 2 and is due to the thermal hydrodynamic mode. 
This is called the Rayleigh peak. The other two are centered at 
O = ±c 0 k with width, Tk 2 . They are called the Brillouin peaks. 


10.E LINEAR RESPONSE THEORY AND THE 
FLUCTUATION-DISSIPATION THEOREM [3] 

Fluctuations in an equilibrium system decay, on the average, according to the 
same linear macroscopic laws (in a fluid system the hydrodynamic equations) 
that describe the decay of the system from a nonequilibrium state to the 
equilibrium state. If we can probe the equilibrium fluctuations, we have a means 
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of probing the transport processes in the system. The fluctuation-dissipation 
theorem shows that it is possible to probe the equilibrium fluctuations by 
applying a weak external field which couples to particles in the medium but yet 
is too weak to affect the medium. The system will respond to the field and 
absorb energy from the field in a manner which depends entirely on the 
spectrum of the equilibrium fluctuations. According to the fluctuation- 
dissipation theorem, the spectrum of the equilibrium fluctuations and the rate 
of absorption of energy from the external field can be expressed in terms of a 
response matrix which is related to the correlation matrix for equilibrium 
fluctuations. 

In this section we derive the fluctuation-dissipation theorem. We first 
introduce linear response theory and use the assumption of causality to obtain a 
relation between the real and imaginary parts of the dynamic susceptibility 
matrix, which is the Fourier transform of the response matrix. We then obtain a 
relation between the dynamic susceptibility matrix and the correlation matrix, 
and we obtain an expression for power absorption in terms of the dynamic 
susceptibility matrix. This gives us a relation between the fluctuations in an 
equilibrium system and energy absorbed by that system when an external field 
is applied. 


10.E.1. The Response Matrix 

Let us assume that external forces, F = {F \ , F 2 , . . . , F n ), are applied to a system 
and that these forces couple to the state variables, A\,A 2 , . . . , A„, causing them 
to deviate from their equilibrium values. We shall assume that the deviations 
from equilibrium of the state variables, Ai,A 2 , . . . ,A„, depend linearly on the 
applied forces (linear response). Then we can write 


(*W)f 


•OO TOO 

dt'K(t - t') ■ F (0 = K(r) • F (t - r)dr. 

J— 00 J— OO 


(10.113) 


The matrix K(f — f 7 ) is real and is called the response matrix. Since the 
response must be causal (the response cannot precede the force which causes 
it), K(f — t') must satisfy the causality condition, 

K(r-r') = 0, r — r' < 0, (10.114) 

Since Eq. (10.1 13) is linear in the force, its Fourier transform has a very simple 
form. If we note that 


1 r 00 

(*(0)f = 27r J («M ) ¥ e~ ,ut duj (10.115) 

and use similar expressions relating F(r) to F(u>) and K(r) to we obtain 

(a(u;)) F = xM • ®V), (10.116) 
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where 


J OO 

K(l)e"'dt (10.117) 

— CX) 

is the dynamic susceptibility. We have also used the definition for the delta 
function, 6(t) = ( 1 / 27 t) J_°^ dcue~ luJt . 

Thus, a force of a given frequency can only excite a response of the same 
frequency. This will not be true if the response function depends on the force 
(nonlinear response). 


10.E.2. Causality 

Let us now study the restrictions imposed on the response matrix by causality. 
We shall assume that the response matrix relaxes fast enough that the integral 

*oo 

K(r) dt < oc (10.118) 

. o 

is finite. Physically, this means that a finite force must give rise to a finite 
response. It is convenient to continue the frequency dependence of x(u;) into the 
complex plane by writing 


I oo 

dtK(t)e iz ‘ , (10.119) 

o 

where z = u> + ie (e positive). If x(w) is well-behaved (nonsingular), then x(z) 
will be well-behaved since the extra factor e~ £t introduced simply makes the 
integral fall off faster. The function x(z) is not singular (has no poles) for z in 
the upper-half complex plane, but we can say nothing about its behavior in the 
lower-half complex plane. Note also that x(z) — > 0 as e — ► oo. 

The matrix x(^) is complex (x*(u>) = x( — ^))» hut causality enables us to 
obtain a relation between its real and imaginary parts. We do this by 
introducting the following trick. Define a new matrix 

f ( z ) = lfeL (10.120) 

z — u 

and integrate it over a contour C' (cf. Fig. 10.2) so that C' encloses no poles of 
f(z). (Note that u is real.) Then 

o f(z)<fe = 0 = £ ,^~ u dz - (10.121) 

Since x(z) — > 0 as £ — > oo there will be no contribution from the semicircle at 
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Im(z ) 



Fig. 10.2. Integration contour used to obtain the Kramers-Kronig relations. 


infinity. Thus, 


X(z) 


c z — u 


dz = 


xM 


. -no U-U 

+ 


duj -I - 


du 


xH 


u+r 


UJ — u 


° U" + T*) _ o. 

- u + re't-u 


It is useful to introduce the Cauchy principal part, 


du 


\r du w + r du 

-u r-OLJ-oo UJ — U ) u+r 


UJ 


xM 


UJ — u 


Equation (10.122) then gives 


( 10 . 122 ) 


(10.123) 


duo = — lim [ id(f>x (u + r e l<t) ) = inx(u). (10.124) 

UJ — U r —>0 


or 


xM 


± p f°° X(“)duJ 

7 Ti J-oc UJ U 


(10.125) 


Equation (10.125) is a consequence of causality and allows us to relate the real 
part, x'M, and the imaginary part, x!'{uj), of the dynamic susceptibility matrix. 
Let us write 


xM = x'M + *x"M (10.126) 


and make us of Eq. (10.125). We then obtain the following relations between 
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%'W and x"H: 


and 


1 00 y ,f ( /A 

X\u)=-P du (10.127) 

7T J_ oo LO-U 

f(u) = -- P f du ;. (10.128) 

7T J-00 U-U 


Equations (10.127) and (10.128) are called the Kramers-Kronig relations and 
enable us to compute the real part of x(u) if we know the imaginary part and 
vice versa. As we shall see, the imaginary part of x(oj) can be obtained from 
experiment. 

The Kramers-Kronig relations give us a relation between the real and 
imaginary parts of the dynamic susceptibility matrix. Therefore, it is only 
necessary to find one or the other to determine the response, («(?)). We can 
show this explicitly as follows. The Kramers-Kronig relations allow us to 
rewrite the dynamic susceptibility matrix, x(u;), as 


XM = lim 

rj — >0 


J 


du' x'V) 

7 r u' — u — ir/ ’ 


(10.129) 


where we have used the identity 


1 

lim — = P 

t)—*o u' — uip ir] 



± iTt6{u' — a;). 


(10.130) 


The response (a (t)) then becomes 
1 


(*( 0 ) = 


2n 


due-^xiu)- ¥{u) 


r oc 


= lim 


v->o (2 ; T y 


r oc r o 

dt' j da; j 


00 -iu>(f-f') 

du'- -XV)-F(0- 

00 u '- U - lT ) 


(10.131) 


In Eq. 10.131, we have used Eq. (10.129) and have Fourier transformed F(o;). If 
we now make a change of variables u" — u — u' and introduce the following 
spectral representation of the Heaviside function, 


Sit -t') = - lim 

V ' T] — >0 


■°° du g-M*-Q 

—00 ( 27 rz) -(- ir) 


(10.132) 
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we obtain 


<«M) = 2i 


dt' K" (t - 0 • F(r'), 

J— 00 


(10.133) 


where K"(r - /') is the Fourier transform of K"(o;). Thus, we have succeeded in 
writing the response entirely in terms of the imaginary part of the dynamic 
susceptibility matrix. The expression for the linear response given in Eq. 
(10.133) is the one most commonly seen in the literature. 

Everything we have done to this point is completely general. Let us now 
obtain an explicit expression for the response for the case of a constant force 
which acts for an infinite length of time and then is abruptly shut off. The force 
we consider has the form 


F(/) = 


{: 


for t < 0 
for t > 0. 


(10.134) 


The Fourier transform of the force is 


t{u) = F 


rO 


e Mt dt = 


= lim F f° 

J_ c 


e lzl dt 


= lim F - = (-/) lim F . , 0 0 _ 

o iz e—o la ; 2 -he 2 up- -he 2 


u le 

+ 


(10.135) 


If we now use the following definitions for the Cauchy principal part 


P I — I = lim 


uj 


lj J £— >o up + e 2 


and the Dirac delta function 


(10.136) 


6(uj) = lim 


we obtain 


F(u) = F 


£—►0 ur + e 2 


F( T- ) + 7t6(uj) 
1 IU) ' 


(10.137) 


(10.138) 


From Eqs. (10.113) and (10.116) the response can be written in the form 

j r oo 


(*W>F = 


2n 


due x{u) • F(o;). 


(10.139) 
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It is useful to write the response in a slightly different form. To do this we 
must generalize Eq. (10.125). Let us introduce the matrices 

f + (z,t) = ^^, t > 0 

z - u 

(10.140) 

and 


f-fe,)-'"**® t < 0 

z - u 

(10.141) 

With our choice of sign in the exponential factors, the functions f±(z, t ) will 
still go to zero as e — ► oc. From Eq. (10.125) we obtain 

iut _ ( . 1 „ f 00 e lwt l{u)du 

<? X(«) = — P — — , t > 0 

iTT J-oc W-M 

(10.142) 

and 


iut _, . 1 „ f 00 e~ iut x(uj)du) 

e X{u) = — P — , t < 0 

J_ oc UJ-U 

(10.143) 

Then, for t < 0, Eq. (10.143) yields the expression 


iTT J-oo u; 

(10.144) 

and, for t > 0, Eq. (10.142) yields the expression 


J-oo o; 

(10.145) 

If we combine Eqs. (10.138), (10.139), and (10.145), the response for t < 0 
is 

(<x(r)) F = X(0)-F 

(10.146) 

and for t > 0 


1 f 00 iiuS) • F 

= duJ cos (ut). 

ITT J_oo U J 

(10.147) 

Thus, while the force is turned on, the response is constant. When it is turned 
°ff, the response becomes time dependent. The variables A\,...,A n begin to 
decay back to their equilibrium values. 
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10.E.3. The Fluctuation-Dissipation Theorem 

The fluctuation-dissipation theorem is extremely important because it relates 
the response matrix to the correlation matrix for equilibrium fluctuations. As a 
result, the external field can be used as a probe of equilibrium fluctuations. 

To derive the fluctuation-dissipation theorem, let us consider a system to 
which a constant force is applied from t = — oo to t = 0 and switched off at 
t — 0. We first write the response, (<x(r)) F , for times t>0 in terms of the 
conditional average (ot(r)) , 

(a(?)) F = | rfa 0 /(ao, F )(a(r)) ao forr>0, (10.148) 

where /(ao, F) is the distribution for fluctuations ao at time t = 0 in the 
presence of a constant external field, F. For times t > 0, the field is no longer 
turned on and we can write 

<«('))„= «- S# ' • «o for t>0 (10.149) 


[cf. Eq. (10.88)1. Combining Eqs. (10.148) and (10.149), we obtain 

(a (0 )f = e ~ M ‘ • j ^«o/(«o, F)«b = e~ Mt • («(0)) F 

= e~ Ut ■ x(0) • F = — P f *iu;cos (a it) ^ • F, 

J-oo ^ 

where we have used Eqs. (10.146) and (10.147). Thus, we find 

e~™‘ ■ x(0) = -r- P [ ducos (cut) . 

J —00 ^ 

If we remember that 


(10.150) 


(10.151) 


C„(t) = («(/)«) = k B e -*"‘ I • g-‘, (10.152) 


[cf. Eq. (10.100)] we may combine Eqs. (10.150) and (10.151) to obtain 


C Ur) = Tff du cos M ^ ■ r‘(0) • I" 1 (10.153) 

, — oo ^ 


for t > 0. Thus, we have obtained a relation between the dynamic susceptibility 
matrix, /(cj), and the equilibrium correlation function for fluctuations. In 
Exercise 10.5 we show that x(0) = g ~ l /T. Therefore, we finally obtain 


C«(0 


k B T 

i'K 


du cos (ufy 


UJ 


(10.154) 
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Equation (10.154) is the famous fluctuation-dissipation theorem. It gives a 
relation between the linear response function and the correlation function for 
equilibrium fluctuations. 


| ■ EXERCISE 10.5. Prove that x(0) = g 1 /T, where g is the matrix whose 
; matrix element is gy = (d 2 S/da i daj) u . 

Answer: The external field, F, does work on the system and increases its 
internal energy by an amount 

! dU = Fdx. (1) 


We can expand the differential of the entropy dS and use internal energy and 
state variables a as independent variables, 


dS = 



dU + 



(2) 




But ( dS/dU) x — l/T and {dS/da.) v = — g • a. Therefore, we can rewrite dS 
as 


dS = - g • dot (3) 

1 For a constant force, (at) = x(0) • F is the expectation value of (a) rather 
1 than zero, and the entropy will have its maximum for a = x(0) • F. Thus, 
from Eq. (3) we have 



and the condition that entropy be maximum at a = x(0) • F yields 



«=X(0)F 


= j F — g • x(0) • F = 0 


or 


(4) 

(5) 




X(0) = '• 


L 


( 6 ) 
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10.E.4. Power Absorption 

The work done by an external force F to change a by an amount dtx, is 

flW=-¥-dz (10.155) 

(this is work done on the medium). The average rate at which work is done on 
the medium is just the power P(t) absorbed by the medium: 

P(<) = (~j _ = -F(t) • <«(»))„ = -F(f) j t |” dt'K(t - t') ■ F(f). 

(10.156) 

If we write the right-hand side in terms of Fourier transforms xi u ) and F(u;), we 
obtain 

( i \ 2 oo p oc 

— ) du dJ • ftd) • F(c j')e- i(u,W)t . (10.157) 

— oo J— OO 

We can now compute the power absorbed and the total energy absorbed for 
various types of external forces. 

10.E.4.1. Delta Function Force 

Let us assume that at time f = 0a delta function force is applied. Then, 

F (t) = F 6(t) and F(o;) = F. (10.158) 

Substituting into Eq. (10.157), we obtain 

( i \ 2 /. oo poo 

— J j du) J dJu'x{J) :¥¥e-^ +u/)t . (10.159) 

(Note: F • /(a;) • F = x(a;) : FF.) We can find the total energy absorbed by 
integrating over all times: 

•OO / 1 \ poo 

w,b s = P(t)dt = -( — \ duu!x"(u): FF, (10.160) 

— oo V^/ J - oc 

where %"(uS) is the imaginary part of the dynamic susceptibility matrix. Since 
the total energy absorbed must be a real quantity, only the imaginary part of 
X(u>) contributes. 

10.E.4.2. Oscillating Force 

Now let us consider a monochromatic oscillating force of the form 


F(r) = F cos ujQt = |F (e*** + e-***). 


(10.161) 
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Then 

F(a>) = 7rF(6(a; + u;o) + S(lj — ujq)). (10.162) 

From Eqs. (10.157) and (10.162), the power absorbed can be written 

Pit) = -J [(-**) {e-^ + l)xM + (w o)(^' + l)x(-wtt)] : FF 

(10.163) 


As we can see, the instantaneous power absorption oscillates in time. We can 
find the average power absorbed by taking the time average of Eq. (10.163) over 
one period of oscillation: 


w»=t 


tt/coq 


dtP(t) = ^[xM - *(-*»)] : FF = : FF, 

(10.164) 


where x "(^o) is the imaginary part of x(^o)* Again we see that the average 
power absorbed depends on the imaginary part of the response matrix. In 
principle, the average power absorbed can be measured, and therefore x^O^o) 
can be measured for all lj 0 . The Kramers-Kronig relations allow us to obtain 
the real part of x(^o) once we know X^o)- The fluctuation-dissipation 
theorem relates xM t0 the correlation matrix C««(t) for equilibrium 
fluctuations and therefore also relates x(^) to the spectral density matrix, 
S«(w), of equilibrium fluctuations. Thus, by applying a weak external field to a 
system, we can probe the equilibrium fluctuations. 


j ■ EXERCISE 10 . 6 . A Brownian particle of mass m is attached to a 
I harmonic spring with force constant, k, and is driven by an external force, 
j F(t). The particle is constrained to move in one dimension. The Langevin 
i equation is 


m d _ x {t) + + mwlx(t) = £(t) + F(f), 


dt 2 


dt 


\ where Uq — ( k/m ), 7 is the friction constant, and £( t ) is a Gaussian white 
1 noise with zero mean. Then equation of motion of the average position, 
| (*(0)/o h* the presence of the external force, F(t), is 


d 2 {x{t)) F , d{x(t)) F 
m dfi +7 dt 


+ mu o(x(t)) F = F(t). 


(a) Compute and plot the linear response function, K(t), where 
(x(t)) F = K(t — t')F(t')dt'. (b) Compute the total energy absorbed for 
the case of a driving force, F{t) = F 0 S(t) [10]. 
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Answer: 

(a) From the condition of causality we know that K{t - t') ^ 0 for t > f 
and K(t — f) = 0 for t < t*. Also, the linear response function is 
independent of the type of force we apply. Therefore, we can use the 
simplest force to obtain an expression for the linear response function. 
If we apply a delta-function driving force, F(t ) = F 0 6(t) at time 
t = 0, then for times t > 0 the response is 


(x( t)) F = K(t)F 0 . 

Given that the average, (x(t)) F , obeys the equation 


m 


d 2 (x(t)) F d{x{t)) 


+ 7‘ 


+ muj${x(t)) F = F(t), 


dt 2 ' ' dt 

we obtain the following equation for the response function 
d 2 K(t) dK(t) 


m 


If we take the Fourier transform of Eq. (3), we obtain 
-mu 2 x(uj) - iuryx(u) + mcj 2 X (cj) = 1, 
where x( u ) is the susceptibility function. Therefore, 

1 


xM = 


—moj 2 + mtjQ — i'jtj 


(1) 


( 2 ) 


(3) 


(4) 


(5) 


We can divide the susceptibility function into its real part, and 
imaginary part, x"(u;), so that 


xM = x'M + ix"M- 


Then we find 


x'(^) = 


m(cjQ — u j 2 ) 


and 


x» = 


m 2 (ujQ — uj 2 ) 2 + 7 2 o j 2 


m 2 (uQ - uj 2 ) 2 -(- 7 2 a ; 2 


(6) 


( 7 ) 


(8) 
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The response function is given by 


K(t) = 


2-7T 


dujxHe luJt = 


du 


—mu 1 + muQ — iyu ' 

(9) 


N otice that the poles of x(cj) occur at u = —i(^/2m) ± 
\/u\ — ( 7 2 / 4m 2 ), which is in the lower half-plane, x(^) has no 
poles in the upper half-plane. For t > 0 we can write Eq. (9) as a 
contour integral, but we must close the contour in the lower half-plane 
so that the integral will be convergent. There are two poles in the 
lower half plane, so we get a nonzero result. For K Owe must close 
the contour in the upper half-plane, but since there are no poles there, 
we get zero. Therefore we find 


= We 7?/2m sin [yju 2 0 7 2 / 4 m 2 1 ) 
my/ul — 7 2 /4m 2 

where 6(t) is the Heaviside function ( 6{t ) = 1 for t > 0 and 0{t) = 0 
for t < 0). A plot of the response function is shown below for the (I) 
underdamped case where uo > ^/lm{uQ = 3, ( 7 / 2 m) = \ ,m = 1) 
and for the (II) overdamped case where uo < 'y/2m(u 0 = 2, 
( 7 / 2 m) = 3,m = 1). If we hit the Brownian oscillator with a delta 
function force, it is displaced from equilibrium and slowly decays 
back to its equilibrium value. 




(b) The total energy absorbed by the harmonic oscillator for the case of 
the delta function force can be obtained from Eq. (10.160) and Eq. ( 8 ) 
above. We find 




_ _ 27T J-c 


du 


yu 2 f l 


m 2 (uQ — u 2 ) +7 2 u 2 



(8) 
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Since the absorbed energy is negative, this is actually energy which is 
dissipated by the oscillator due to friction. 


10.F. TRANSPORT PROPERTIES OF MIXTURES 
[3, 11, 12] 

Hydrodynamic processes are a consequence of both (a) symmetries at work in 
fundamental interactions and (b) entropy producing decay processes caused by 
interactions and internal changes in the constituents of a hydrodynamic system. 
In Sections 10.B and 10.C we derived the full set of hydrodynamic equations 
for a single-component classical fluid of point particles. Most of the interesting 
behavior in such a system occurs in the macroscopic flow processes. Indeed, 
three transport coefficients characterized the decay to equilibrium of such a 
system, thermal conductivity, shear viscosity, and bulk viscosity. Both the shear 
and bulk viscosity characterize decay processes in the macroscopic flow of the 
fluid. Only the thermal diffusion would remain if the fluid were at rest. In this 
section we will consider a multicomponent fluid of particles that can interact 
and undergo chemical reactions, but we will assume that, while constituent 
particles may move relative to one another, the fluid as a whole does not move. 
Under these conditions, there will be no viscosity but there will be various kinds 
of diffusion processes at work. In Section 10.F.1 we will derive a general 
expression for the entropy production in such systems, since this is the starting 
of any transport theory. In Sections S10.D and S10.E we will study some 
examples of transport processes in multicomponent fluids and membranes. 

10.F.1. Entropy Production in Multicomponent Systems 

Let us imagine a large system containing N different kinds of molecules under- 
going r different chemical reactions. We will focus on a small sub-volume, V, 
within that system and assume that the small subvolume can exchange energy 
and matter with the larger system it is embedded in. The combined first and 
second laws describe the thermodynamic behavior of the material inside the 
subvolume, V. They can be written TdS = dU + PdV — ^2jLi Pj dm j an( * ^ 
fundamental equation can be written TS — U + PV — Pj m v w h ere m j ls 
the mass of particles of type j. The mass, mj, can be written ntj = Mjtij, where 
Mj is the molecular mass (grams/mole) of particles, j, and ttj is the number of 
moles. It is useful to rewrite the combined first and second laws in terms of 
densities. We will let s = S/V, u = U/V, and pj = ntj/V, where pj is the mass 
density of particles of type j. Then we find 

N 

Tds -du + ^2 h d Pj = 0 (10.165) 

;=i 

for a subsystem with variable volume, V. 
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Let us now return to the first law, dU = dQ — PdV 4- YljLi A ijdntj, which 
keeps track of energy exchange. We can divide the heat transfer, dQ, into a 
contribution from pure heat exchange and a contribution due to transport of 
matter. This is easy to see for reversible processes where we can write 

N 

SQ = TdS = T[dS\ {mi] +tJ2 ¥*"/■ (10166) 

7=1 

where sj — (dS/drrij ) T is a partial specific entropy. If we further note that 
sjT = hj- flj, where hj — (dH/dm.j) T p^ mj ^ is a partial specific enthalpy and 
[Lj is a partial specific Gibbs free energy (the chemical potential of species, j), 
then Eq. (10.166) takes the form 


N 

SQ = TdS = T[dS\ {mi) + Y, (k - (10.167) 

7=1 

The first term, is the change in the heat content of the system due to 

mechanical and thermal heat exchange with the outside environment. It does 
not include the change in heat content due to flow of particles into or out of the 
system. For a system out of equilibrium, where dQ < TdS, we can write 
dQ = dQ' + Ylj = i (h ~ Pi)dmj, where d O' represents pure heat exchange with 
the environment, minus contributions due to particle exchange. When written in 
terms of dQ', the first law takes the form 

N 

dU = d0! - PdV + J2 VH- (10.168) 

7=1 

We will now write Eq. (10.168) for densities. We again define s = S/V, 
u = U/V, and pj = ntj/V, but we also introduce the heat exchange per unit 
volume, dq' = dQ! /V . The Eq. (10.168) can be written 

V | du — dq' — hjdpj^j — dV (-“- p + EVlj- (10-169) 

However, u + P = H/V and YIjLi h’Pj — H/V. Therefore, the term on the right 
is identically zero and we obtain 

N 

du = dq' + ^2 h d Pj = dq, (10.170) 

7=1 

where dq is the total amount of heat transfered per unit volume. If we substitute 
Eq. (10.170) into Eq. (10.165), we can write the following expression for the 
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entropy change in our open system 


N 

Tds = dq — ^ Pjdpj. 
7=1 


(10.171) 


We can now use Eq. (10.171) to compute the entropy production in open 
systems. 

We will neglect convection of fluid in our small volume, V. That is, we 
assume that the baryocentric velocity of the fluid, v = YljLi Pj y j/p = 0, where 
p = YljLi Pj i s total mass density. If J; = pj\j is the mass flux (grams/ 
area second) of particles of type j, this gives the additional condition, 
YljLi J /' — 0- The time rate of change of thermodynamic quantities inside the 
volume are related through Eq. (10.171). Since v = 0, we can write 


ds dq _ dpi 

3rarX> 0T- 


(10.172) 


Changes inside the volume, V, are due to flows across the boundary and sources 
inside the volume. We can therefore introduce the following balance equations 
for the entropy density, 

ds 

-Ql = — V r • Js + <7, (10.173) 

the mass density of particles of type j. 


f =-V r -j; + E«. (> 0 - 174 ) 

a= 1 

and heat density, 

^ = -V r -J,. (10.175) 

In Eq. (10.173), a is the entropy source term and is the quantity we wish to find. 
In Eq. (10.174), J j is the mass flux of particles of type j across the boundaries, 
and i>j !a = where u j)0l is the stoichiometric coefficient for the y'th particle 

in the aih chemical reaction. The ath chemical reaction acts as a source (or 
sink) of particles of type j, and the “source” term, J c a = (l/V)(df; a /dt), 
represents this contribution (£ Q is the degree of reaction for the cdh chemical 
reaction). In Eq. (10.175), J q is the total heat flow across the boundaries. If we 
substitute these equations into Eq. (10.173), we find 


-v r 


J$ + (7 — — — V r 


N 

j,-E 

7=1 


p± 

T 




a= 1 


(10.176) 
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If we now rearrange terms we obtain 


-V r -J 5 + <T=-V r 


$q~ 52 j= 1 AjJ / 


+ S , • V r (i) V r (fj + £ •/' y , 

\ / j=l ' ' a=l 


(10.177) 


where A a = Vj,afc is the affinity of the ath chemical reaction. Thus, the 
entropy current is 


h = 


52j=l P-A 


(10.178) 


and the entropy source is 


a 


= J,-^G)-Ejrv r (f)-E^y 

V / j=l \ 1 / a=l 

= -J5-v r r-i 


r f=. 


(10.179) 


The entropy source term is again written as a product of a current and an affinity 
or force (the two words are often used interchangeably). Gradients in the 
temperature and chemical potentials of the various constituent particles act as 
forces driving the heat current, J q , and the particle mass current, J ; . The 
chemical affinity, A a , is the force driving the ath chemical reaction. 

It is sometimes useful to write Eq. (10.179) in a slightly different form. Let 
us note that <f(/^/r) = [dp.,] t /T+ [(d/dT)(fij/T )] Plm] . But [(d/dT)(p,j/T)} PAm} 
= -(l/T 2 )(p,j-SjT) = -lij/T 2 , whereij = (dS/dmj ) Tf , {m ^ , = -(dj},/dT),, {m] . 
We finally obtain d(p,j/T) = [dfij] T /T — (hT 2 )dT. This in turn leads to the rela- 
tion Vrifij/T) = [V r Pj] t /T — ( hj/T 2 )V r T . If we substitute into Eq. (10.179), 
we obtain 


Trj = — i j; • V r T - £ Jj • [V r £,] r - £ J c a A a , (10.180) 

1 1 a=l 

where J' q = J q - ^ . fyj) is the heat current minus the contribution due to 
diffusion. Notice that the entropy production is again written in terms of 
products of affinities (forces) and currents. The affinity driving the heat current 
is a temperature gradient. The affinity driving a mass current of particles of type 
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j is a gradient in the chemical potential of particles of type j. The affinity driving 
the orth nonequilibrium chemical reaction is just the chemical affinity, A a . 

It is interesting that even in a completely homogeneous system, chemical 
reactions out of chemical equilibrium create entropy. A chemical reaction is a 
spontaneous, random event which, if the system is out of chemical equilibrium, 
causes a discrete change in the chemical composition of the system. It is the 
prototype of a spontaneous process. As we show in Exercise 10.7, chemical 
reactions also provide a simple example of how the underlying microscopic 
reversibility of a system in equilibrium (the principle of detailed balance) leads 
to Onsager’s relations between cross-transport coefficients (cf. Section 10.D.1). 


■ EXERCISE 10.7. The cyclic set of chemical reactions 

k 1 h k} 

A^B, B^C, C^A, 

k-\ k. 2 Jk-3 

provides a “textbook” example of the use of Onsager’s relations. When the 
reactions are out of equilibrium they produce an amount of entropy/volume 
•time, o, where To = J X A X + J 2 A 2 A- hA 3 ,J a = {l/V)(d£ a /dt) is the 
current for the ath reaction, and A a is the affinity for the ath reaction. 
Assume that the chemical potential is that of an ideal gas mixture of 
molecules, A,B, and C. (a) Show that near equilibrium, J x —Jt ) = 
L\\A\ + L 12 A 2 and find L\\, L X 2 , Lai, L22 and J2 — J3 — La\A\ 4- L22A2. (b) 
Show that L \2 = La\ . 

Answer: The degree of reaction, d£ a , of the ath reaction (for a = 1,2, and 
3) is 



dn B 
vbi ’ 



VB2 


dn c 

VC2 ’ 


and d £ 3 = < ^~ 
VC3 



where u AX = v B2 = v c2) = — 1 and u BX = v C 2 = ^A 3 = +1. The time rate of 
change in the number of moles, n A , of molecule A due to reaction a = 1 
is dn A /dt = k- X n B — k x n A . The time rate of change of the number of 
moles of molecule B due to reaction a = 2 is dn B /dt = fc_2«c — h n B , and 
the time rate of change in the number of moles of molecule C due to reaction 
a = 3 is dnc/dt = k-zn A - k^itc- The affinities for reactions a = 1,2, and 
3 are 


Ai = n A - Hb, A 2 = Hb — Me, and A 3 = nc ~ Ha, (2) 

respectively, where nj = Mjflj. Since A3 = — (Ai + A2), the rate of entropy 
production per unit volume, a, due to the chemical reactions can be written 

To = ( J x — 7 3 )Ai + (J 2 — 7 3 )A2. (3) 
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i Therefore, the generalized Ohm’s law takes the form 

J\ — J3 = L\\A\ + L12A2 and J2 — J3 = Li\A\ 4 - £22^2- ( 4 ) 

| Now we must find explicit expressions for the Onsager coefficients, 
Ln,Li2,L2\, and L22. 

From the rate of change of mole numbers, we can write the currents as 

l 

J\ = k\c A - k-\c B , J 2 = k 2 c B - k_ 2 c c , and J 3 = k 3 c c - k- 3 c A , 

( 5 ) 

where Ci = rii/V is the molar concentration of molecules of type i. At 
j equilibrium these currents are zero. If we denote the equilibrium molar 
densities as c A ,c B , and cc, then we find the equilibrium conditions 


cb 

c A 


k - 1 


cc 

c B 


k2_ 

k-2 


, C A k 3 

and — = - — 
cc k- 3 


( 6 ) 


We can also write the affinities. Let us approximate the Gibbs free energy of 
the system by that of an ideal gas mixture of molecules A,B, and C, 


/y(5/2)\ 

G(P,T,n A ,n B ,n c ) =G^ — nRT In f — — j -f- n A RT\n(xj 
+ n B RT\n(x B ) + n c RT\n(x c ), 


( 7 ) 


where is a constant and x , is the mole fraction of molecules of type i. 
The chemical potentials are partial Gibbs free energies and are given by 


fi A = —RT In 
ji B = —RT In 

lie = —RT\n 
The affinities are therefore 

A\ = -RT\n 
A 2 = -RT In 
A 3 = -RT In 


T) 

J 1 5/2 


P 

y5/2 


- 1 -RT ln(x^), 
-f-^rin(;cfl), and 
-f-^rin(xc). 


( 8 ) 


©- 

(I)' 

©■ 


= -RT In 
and 


( £a\ 

\ ca ) ’ 


( 9 ) 
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We now want to approximate the currents and affinities by their values close 
to equilibrium. Let us expand the molar densities about their equilibrium 
values, c A = c A + a A ,c B = c B 4 a B , and c c = c c 4- etc , where a A ,a B , and 
ac are small deviations from equilibrium. Then we find 


r a 

J i = A i , 

RT ’ 

Thus, we finally obtain 


, - kl CB a 

2 RT A2 ’ 


and h =^£t(Ai +A 2 ). 


RT 


A ~h = 


k\c A 4- k^cc 

RT 


Ai + 


h c c 

RT 


M 


( 10 ) 


( 11 ) 


and 

j 2 - h = h££ Al+ k * ° ±M£ A2 . (12) 

RT RT y J 

We therefore find L u = (l/RT)(kic A + k 3 c c ),L l2 = Ln = ( hc c /RT ), and 
Ln = ( l/RT)(k 2 C B + /c 3 c c ) so that Onsager’s relations are satisfied for this 
cyclic set of chemical reactions. 


10.F.2. Fick’s Law for Diffusion 

In order to develop some intuition about these equations it is useful to consider 
Brownian motion. Let us imagine that we have a very dilute mixture of a 
solvent (which we will choose to be water) and a solute (which we will leave 
variable). Let p w (p s ) be the density of water (solute) and let p = p s + p w be the 
total density. We will assume that the mass density of solute is much smaller 
than the mass density of water, so that x = (p s /p w ) < 1- Also, we shall assume 
that the pressure, P, and temperature, T, are uniform throughout the fluid. From 
Eq. (10.180) the entropy production, a, is given by 


T(7 — Jyv " Jf ' [V tPs\t‘ (10.181) 

From the Gibbs-Duhem equation (2.62), for constant temperature and pressure, 
we find Ps[Vr/2j] r p + p w [V r p w ]T,p = 0- Also, J, + J w = 0 since the baryo- 
centric velocity is assumed to be zero. Therefore, the entropy production can be 
written 


To = - (j, - *) • [V r /iJ r ,f>. (10.182) 

The current that appears in Eq. (10.182) is the diffusion current, 
Jf = J, - (ps/pw) Jw = Jj(l 4- ps/Pw ), of solute relative to water. This is easy 
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to see. Since J 5 = p s \ s is the current of solute relative to the walls and 
j w = p w \ w is the current of water relative to the walls, jf = J* - ( p s /p w ), 
= p s (v s — Vh,) is the current of solute relative to water. It is this current that 
must be nonzero to have diffusion of solute relative to water. In terms of the 
diffusion current, the entropy production takes the form 

a = -j? ■ iy r fi,] TiP . (10.183) 

We can write a generalized Ohm’s law for this process. Since for a binary 
mixture, p s = p s (T,P,x s ), where x s is the mole fraction of solute, we have 

j? = -L s [V,jl,\ Tr = -L s {^\ V 'X S . (10.184) 

If we treat the mixture as an ideal mixture, then p s = A* + {RT /M s ) ln(*,) and 
(dfi s /dx s )p T = RT/x s M s . Also note that V r x s = {c w /c 2 )Vc s - {c s /c 2 )Vc w , 
where c, = n,-/V is the concentration (moles/volume) of molecules of type i. 
However, since c s <C c w , to lowest order in c s /c w , we can write the particle flux, 
jf (moles/area • second), as 

tD RT 

} ’=wr- L ’M^ s v ' c ^- DV ' c - (iai85) 

The equation jf = — DV r c s is Fick’s law for diffusion, and D = L s RT/M]c s is 
the diffusion coefficient. If we equate the diffusion coefficient, D, to the 
Einstein diffusion coefficient for Brownian motion, D = RT/ (N A 6nr)a), then 
we find L s = c s Mj /N A ^rja, where rj is the coefficient of viscosity for water, a 
is the radius of the solute molecule, and N A is Avogadro’s number. We can 
obtain a numerical estimate for these quantities. At T = 25 °C = 298.15 K, the 
viscosity of water is rj « 0.9 x 10~ 3 P • sec (Pascal-seconds). For solute 
molecules with a radius of 4A, the Einstein diffusion coefficient is 
D « 0.61 x 10 -5 cm 2 /sec. We can compare this with actual measurements of 
the diffusion coefficient, given in Table 10.1, for very dilute aqueous solutions. 
The value given by the Einstein diffusion coefficient is of the correct order of 
magnitude. It is also interesting to see how the diffusion coefficient depends on 


TABLE 10.1. Diffusion Coefficients for Some Typical Substances in the 
Limit of Infinitely Dilute Aqueous Solution ( T = 25°C) [14] 


Solute 

Formula 

D (10 5 cm 2 sec ') 

Acetone 

c 3 h 6 o 

1.28 

Benzene 

c 6 h 6 

1.02 

Ethanol 

c 2 h 6 o 

1.24 

Methane 

ch 4 

1.49 

Ethylbenzene 

C 8 H,o 

0.8 
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TABLE 10.2. Diffusion Coefficient, D, of Benzene in Some typical Solvents in the 
Limit of Infinitely Dilute Solution ( T = 25°C). 77 is the Viscosity of the Solvent [14] 


Solvent 

Formula 

D (10 5 cm 2 sec J ) 

7/(10 3 Pa-sec) 

1 -Butanol 

C 4 H 10 O 

1.00 

2.544 

Cyclohexane 

c 6 h 12 

1.41 

0.894 

Ethanol 

c 2 h 6 o 

1.81 

1.074 

Heptane 

C 7 Hi 6 

3.91 

0.387 

Toluene 

c 7 h 8 

2.54 

0.560 


the solvent. In Table 10.2 we give the diffusion coefficient for some infinitely 
dilute solutions of benzene in various solvents. 


10.F.3. Thermal Diffusion 


If a temperature gradient exists in a mixture of molecules, it can drive diffusion 
processes. Also, if a concentration gradient exists in a mixture, it can induce a 
temperature gradient. Diffusion induced by a temperature gradient is called 
thermal diffusion. The parameter that measures its strength is called the Soret 
coefficient. 

Let us again consider a binary mixture consisting of water (the solvent) and a 
solute. We shall assume that the pressure, P, is uniform throughout the mixture. 
From Eq. (10.180) the entropy production, a, is given by 



K 

K 


' V r T Js • [Vr/ij]^ Jw • [V r 
■ V r T — • [Vr/ijjy- p, 


(10.186) 


where jf is the mass current of solute relative to water. The generalized Ohm’s 
law for this system can be written 


— — r T , 

J q = ~ r T. 


(10.187) 


where L ss ,L sq ,L sq , and L qq are the transport coefficients associated with this 
process. Let us again assume that we have a dilute mixture of solute in water so 
that [V r £*]r,p ~ ( dp, s /dc s ) TP V r c s . Then Eqs. (10.187) take the form 
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The diffusion coefficient is then D = (L ss /M s ) (Off/ dc s ) Tp . The coefficient of 
thermal conductivity is K = L qq . In experiments it is found that the coefficient 
l sq is linearly proportional to the solute concentration. Therefore, a coefficient 
of thermal diffusion , D T , is introduced and is defined so L sq = M s c s D T . The 
solute particle flux, jf — jf /M s , then takes the form 

jf = -DW r c s - c s D T V r T. (10.189) 

The Soret coefficient , St, is defined as St = D T /D = L sq /{M s c s D). The Soret 
coefficient is a measure of the concentration gradient that is set up as the result 
of a temperature gradient for a system in its steady state. The steady state is 
defined so that jf = 0 and V r T =constant. Therefore, if we set jf = 0, the 
Soret coefficient can also be written St = — V r ln(cj)/V r T. The effect of 
thermal diffusion is generally small with D r «10“ 2 D or D J « 1 0 — 3 Z>. 
However, it has been used as a basis to separate isotopes (Kr 84 from Kr 86 and 
HC1 35 from HC1 37 ) in the gaseous phase [13]. 

10.F.4. Electrical Conductivity and Diffusion in Fluids [12] 

Electrical conductivity of charged particles in a fluid and the diffusion of those 
particles are closely related phenomena, because in both cases the process is 
governed by how the particles interact with the medium around them. Let us 
imagine a fluid consisting of a dilute solution of charged solute particles (ions), 
with charge z s e, in water. We can place electrodes in the solution and create an 
electric field, E = — V r 0, in the fluid. There may also be a gradient in the 
concentration solute particles. Under such conditions there will be a transport of 
solute particles from one region of the fluid to another. For simplicity we shall 
assume the temperature, T, and pressure, P, are uniform thoughout the fluid. 
However, we shall assume that there are gradients in the chemical potentials of 
the ions and the water and an electric potential gradient in the fluid. The entropy 
production, a, is given by 

Ta = -J s • V r $ - J w • Vr/iw, (10.190) 

where ft e s is the electrochemical potential of the ions. From the Gibbs-Duheim 
equation we know that for constant T and P, p s S7 r p e s + p w V r fl w = 0. Therefore, 
we can rewrite the entropy production as 

To = — Jf • V r g, (10.191) 

where jf = J 5 - (p s /p w )3 w is the ion current (relative to the water) induced by 
the electrochemical potential gradient. For a very dilute solution, p s p w and 
J.v ~ J 5 . The generalized Ohm’s law for this system can be written 
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Equation (10.192) contains information about both diffusion and electric 
conduction processes. 

If no electric potential gradient exists in the fluid and if the solution is 
sufficiently dilute that the interaction between ions can be neglected, then the 
ions diffuse in a fairly normal manner. The generalized Ohm’s law takes the 
form 


J? = -L—~V r c s = -DV r c s , (10.193) 

AigCg 

and the diffusion coefficient is D = LRT/M s c s , where M s is the molecular 
weight of the ion (grams/mole) and c s is its concentration (moles/volume). 

Let us now consider the case in which the concentration of ions and water are 
uniform, but a constant electric field is imposed across the fluid. Then the ion 
current takes the form 


Jf = -£V r # = ~L jf V r 0 = L ^ E. (10.194) 

where F is the Faraday constant (cf. Exercise 2.7). 

In the limit of very dilute concentration of ions, jf « J*, where J 5 gives the 
ion flux (moles/area ■ time). The electric current, /, carried by these ions is 

/ = ZsFjf = L E. (10.195) 

The electrical conductivity of the fluid is k = I /E. Therefore, 

_2 p 2 ~ -2 p 2 

K = L \r = ^—D = z s c s \, (10.196) 

Ml § t\ 1 

where A is the equivalent conductance. Thus we have related the diffusion 
coefficient for an ion to its electrical conductivity. In Table 10.3 we give the 


TABLE 10.3. The Diffusion Coefficient, D, and Equivalent Conductance, A, for 
Some Ions in the Limit of Infinitely Dilute Aqueous Solution ( T = 25° O [14] 


Ion 

£>(10 5 cm 2 

H+ 

9.311 

K+ 

1.957 

Na+ 

1.334 

cr 

2.032 

P- 

1.475 


! ) A(10 _4 m 2 Smol _1 ) 


349.65 

73.48 

50.08 

76.31 

55.4 
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diffusion coefficients for some commonly found ions in the limit of infinitely 
dilute aqueous solution. 


I g EXERCISE 10.8. Consider a very dilute solution of salt, which we 
| denote A~. C+ , which dissociates as 

i/ a *'c 

! A Va C t ^ VaA ~ + ^ C+ ’ 

where v a and v c are the stoichiometric coefficients for the dissociation. 
| Compute the diffusion coefficient for the salt in terms of the diffusion 
j coefficients of the ions in the limit of an infinitely dilute solution. (Assume 
j uniform temperature and pressure throughout the system). 

; Answer: Let us assume that there are no external electric fields present, so 
I the mass current is driven only by gradients in the chemical potentials of the 
i ions. The entropy production, a, is given by 

i 

I To — J a • V £i a Jc ‘ V Me J w ' V Mw ^ (1) 

I 

I 

where J is the mass flux of particle of type i(i = a,c,w). From the Gibbs- 
| Duhem equation we know that for constant T and P,p fl V r M fl 4- p c V r M c + 
PwVrMw = 0. Therefore, we can rewrite the entropy production as 

I 

! Ta = -3° ■ Vr/i. - jf • V r /i c , (2) 

j where jf = J j — {pj /p w )iw{i = a, c) is the diffusion mass current of ion i. In 
| the limit of infinitely dilute solutions, we can neglect the effect contributions 
1 to the current of cations due to gradients in the chemical potential of anions, 

| and vice versa. Therefore, in this limit the generalized Ohm’s law for this 
l system can be written in the simple form 

I 

J a = -L a VrM a and J? = -LcVrMc- (3) 

I 

( 

We can restrict ourselves to the case when the total electric current is zero so 
that 

; Zj? + ZcJ? = ZaS D JM a + Zcif /M c = L a V t p a - = o, 

(4) 

where J ,(i = a, c) is the flux of ions of type i. From Eq. (4) we obtain 
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The condition for chemical equilibrium of the dissociation of salt into ions is 
given by 


M. S ^J ffl s — l^a^a^rfJ'a “I” r/-^c • 

If we combine Eqs. (5) and (6), we can write 

/ \ 
ZaLaMc 


V r /ic — 


VcM^ZaLa VqM^ZcLc 


r As> 


(6) 


(7) 


and 


^ r P'a — 


z c L c M a 


VaM^ZcLc Za^a 

The particle flux of salt molecules is given by 




( 8 ) 




(9) 


We can now substitute Eqs. (7) and (8) into Eq. (9) to compute the particle 
flux of salt molecules. Let us first note that L a = D a M*c a /RT and 
L c = D c M^c c /RT, where D t is the diffusion coefficient for ions of type i. 
Let us also note that c a — i/ a c s , c c = u c c s , and V r fi s = (RT /M s c s )\/ r c s . If 
we make use of these relations, we find J 5 = — D s V r c s , where 


D r = 


( Zc + \Zq\)D a Dc 
Z C D C + \Za\D a 


(10) 


is the diffusion coefficient for a very dilute aqueous solution of salt 
molecules. 


► SPECIAL TOPICS 

► SI O.A. Onsager's Relations When a Magnetic Field is Present [3] 

When a magnetic field, B, is present or when we consider a system in a rotating 
container where Coriolis forces are present, the Onsager relations must be 
generalized. When we reverse the time and velocities in the equations of 
motion, we must also reverse the magnetic field, B, in order to keep the 
equations of motion invariant, since the magnetic force on particles with 
velocity v and charge q is F = q\ x B. If we have a rotating container, we must 
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reverse both the velocity of the particles and the angular velocity co of the 
container to keep the Coriolis force F = 2 mv x to from changing sign. 

In the presence of a magnetic field and fluctuations (3 (the fluctuations (3 
change sign under time reversal), the distribution function must satisfy the 
relation 


=/(«,-£, -B) ( 10 - 197 ) 

under time reversal [here a = (ai , a 2 , • • • , a n ) and p = (fo , . . . , /3 m )]. This is 
easily understood if we write /(a, /?, B) in terms of microscopic states: 

/(a ’* B)= dwf'K dp "- (10198) 

E—*E+AE 


For both classical and quantum systems, energy is an even function of the 
momentum and the magnetic field. Therefore, if we reverse time and the 
magnetic field in Eq. (10.198), the integral does not change and Eq. (10.197) 
follows immediately. 

When a magnetic field and fluctuations (3 are present, we can write the 
distribution function in the more general form: 


/(a, fi, B) = Cexp ( - — (g : aa + m : a/J + n : fit* + h : fifi) 
Z/Cb 


(10.199) 


where C is the normalization constant and the tensors m and n can be 
represented bymxn rectangular matrices. The tensors g, m, n, and h may now 
depend on the magnetic field. In order to satisfy Eq. (10.197), they must have 
the following behavior when the magnetic field is reversed: 

g(B) = g(— B), h(B) = h(— B) (10.200) 

and 

m(B) = -m(-B), n(B) = -n(-B). (10.201) 

Note that when B = 0 we have m = n = 0. The matrix rh is simply the 

transpose of n: 


m = n r . (10.202) 

The change in the entropy due to fluctuations in the presence of a magnetic field 
is 


AS = — \ (g : aa + m : a.p + n : ptx, + h : pp) 


(10.203) 
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and we must introduce two generalized forces: 

X = -^^- = g-& + \p-fa + \ii- p = g-a + n /? (10.204) 

and 

Y = — = h ■ /? + m • a. (10.205) 

[Note that in Eqs. (10.204) and (10.205) we have used Eq. (10.202)]. The 
various time-independent correlation functions take the form 


(owe) = k B (g — n • h" 1 -m) -1 , 

(10.206) 

(a P) = -k B ( g - n h _1 m) _1 n h 1 , 

(10.207) 

(P<x) = -k B ( h - m g _1 n) -1 m g -1 , 

(10.208) 

(PP) =k B ( h-m g 1 • n) _1 . 

(10.209) 


In the presence of a magnetic field, Eq. (10.84) is generalized to the form 

/’(qVVVV.B) = p(q'Vp"V-p'V,-B), (10.210) 

and the joint probability density for fluctuations is generalized to the form 

/(a, /}, B)P(a, , /?', r, B) = /(a', -B)P(a', -ft, r, -B). 

(10.211) 


From Eq. (10.210) it follows that the time-dependent correlation functions 
given by expressions analogous to Eq. (10.76) satisfy the following relations: 


(aa(r);B) = (a(r)a;-B), (10.212) 

( a /H r );B) = — (a(r)/f; — B), (10.213) 

(/1«(t);B) = -</((t)«;-B), (10.214) 


= (10.215) 


under time reversal. 
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The equations describing the regression of fluctuations can be written as in 


Eq. (10.87) except that now the matrix M depends on the magnetic field. Thus, 


-l 

dt 1 

M(B) (<mw’ 

(10.216) 

where 


ug _ ( M“(B) M^fB) \ 

\M^*(B) M^(B ) ) 

(10.217) 

or 

d , 

dt 

Iwowj- L(B) l(V(0),J- 

(10.218) 

If we use arguments 

similar to those in Section 10.D, Onsager’s 

relations take 

the form 


L a< *(— B) 2 " = L“*(B), 

(10.219) 



L^*(— B) r = -L a/, (B), 

(10.220) 



L^(-B) r = — L^ a (B), 

(10.221) 

and 




' 


L^(-B) r = B). 

(10.222) 


Equations (10.219)-(10.222) give the general form of Onsager’s relations. If we 
were considering a rotating system, then we would replace the magnetic field, 
B, by the angular velocity, u>. 


► S10.B. Microscopic Linear Response Theory [15] 

Kubo was the first to show that it is possible to derive the linear response matrix 
directly from microscopic theory. Let us consider a system to which we apply 
an external field, Fj(r, t), which couples to microscopic densities, %(r). The 
total Hamiltonian of the system in the presence of the external field is 

H(t)=H 0 + AH(t), (10.223) 

where H 0 is the Hamiltonian of the system in the absence of the field and 


A H{t) = 


dra(r) • F(r, f) 


(10.224) 
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is the contribution to the Hamiltonian due to the external field. We assume that 
the external field is turned on at time t = — oo. The total density operator, p(t), 
in the presence of the field satisfies the equation of motion 

(10.225) 

Let us write the total density operator in the form 

p{t) = peq + Ap(t), (10.226) 

where Ap(t) is the change in the state of the system due to the perturbation and 
Peq = e~P H °/Tr [e~^ H °] is the equilibrium density operator. Since [Ho, Peq) = 0, 
we find 

1*^ = [Atf «, M + [Ho, A0(O1 + [AH(r), Aflf)]. (10.227) 

To obtain an expression for Ap(t) which is linear in the applied field, we will 
neglect the nonlinear term, [Atf(f), Ap(t)], and write 

m ^r = ^ + [ *°’ (10 - 228) 

It is now possible to solve Eq. (10.228) for A p(t). 

The solution to Eq. (10.228) is straightforward. First write 

A p{t) = e-^A pi{t)e iflot/H 

and plug this expression into Eq. (10.228). Then the equation for A pi{t) is 

m d - -Q t ^ = [e ifiot/h AH{t)e- ifiot/h , p eq ). (10.229) 

We can integrate Eq. (10.229). If we assume Ap/(— oo) = 0, we find 

A P/(t)=4 ^ dt '[ eik ^ /h A H(t')e~ iflot, / h ,p eq ]. (10.230) 

^ J— OO 

The equation for Ap(t) is then 

A/5«=i|' dt'[e- ik ^- ,) l h ^H(t’)e ,k ^l h ,p eq \. (10.231) 

^ J— OO 

We can use Eqs. (10.226) and (10.231) to find the average value of any desired 
quantity. 
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Let us first compute the average value of the operator, a, (r), at time t. Let us 
assume that Tr[di(r)p eq \ = 0, although this not always the case. Then we find 

r, 0) = Tr [fl.-(r)Ap(f)] = - ^ dt' | dr' ([a,(r, t), dj{ r', f')]) c /)(r', 0, 

(10.232) 

where 

{[d i {r,t),d j {r\t , )\) eq = Tr(p^[a,(r, t),a/(r', f')]) (10.233) 

and 

a,(r, t) = r)e-‘"»'/ e . (10.234) 

To obtain Eq. (10.232), we have cyclically permuted operators under the trace 
and have assumed that («,•( r)) = 0. If we compare Eqs. (10.133) and (10.232), 
we can write the response as 

(di( r, 0 ) = 2i ^ [ Jr' (r, r'; t - t')Fj(r\ f'), (10.235) 

— oo J 

where we use the notation of Section (10.E) and 

r '; t - 0 = ^ <[«<(**, f')]) c<r (10.236) 

Thus, from microscopic linear response theory we obtain the result that the 
dissipative part of the response matrix is given by the equilibrium average of the 
commutator of the relevant operators at different times and positions in space. 

We can also obtain the fluctuation-dissipation theorem from microscopic 
linear response theory. The position-dependent correlation function in a 
translationally invariant stationary system can be written 

<W r - I*'; r) = (fi f (r, r)aj( r')) eq , (10.237) 

where we have assumed that ( di(r)) eq = 0. Write the Fourier transform is 

G a h aj{ k, r) = d( r - r')e~ , ' k (r ” r/) (a,-(r, r )dj{ r’)) eq 
= i Jr Jr'<T lk (r_r ' ) (a,-(r, r)a y (r')) e<? 

= ^ (flf(-k,r)fly(k))^ = i (a*(k,r)fl y (k)) er 


(10.238) 
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In Eq. (10.238), we have used the definition, dj( k) = J dre +ik T dj( r). From Eq. 
(10.236), the momentum-dependent response matrix can be written 


^(k.T) = Ti([a*(k,r),axk)]) c ,. 

(10.239) 

However, 


{aMr)) eq = <«/( r - iWY^eq = e~ mid/dT) {di{T)aj) eq , 

(10.240) 

where di{—ih,(3) = e^°die~^° . Therefore 


C,(k.r) = L (1 - e -W a/ ^)G a „ 0 / (k,r). 

(10.241) 


The Fourier transform of Eq. (10.241) takes the form 

X«„„( k > w ) = T (1 - e~^)S aiMl ( k,w), (10.242) 


where 5 a . )aj (k,o;) is the spectral density matrix. Equations (10.240) and 
(10.242) were first derived by Kubo [15] and are the form of the fluctuation- 
dissipation theorem that results from microscopic linear response theory. We 
see that the imaginary part of the response matrix is expressed directly in terms 
of the equilibrium correlation function. The dynamic susceptibility can be 
expressed directly in terms of S ahaj (k, u) and is given by 


Xa h aj{Ku) = lim 
T ]-*0 


'du>' (l-e-^X^kX) 
27 rh uj' — u — irj 


(10.243) 


where we have made use of the Kramers-Kronig relations. 


► S10.C. Light Scattering [16-19] 

The use of light scattering as a probe of the structure of matter began a little 
more than 100 years ago. The incident light polarizes a medium, and the 
medium then reemits the light in a manner which depends on the dynamics of 
the medium. Light scattering as a tool to study the internal dynamics of systems 
has developed in three stages. From the late 1800s until 1928, light scattering 
experiments involved measurement of the intensity of scattered light as a 
function of angle with respect to the incident direction. However, in 1928, 
Raman measured the spectrum of light inelastically scattered from gases, and he 
found frequency shifts in the scattered light due to internal degrees of freedom 
of the molecules. From then on, most light scattering experiments focused on 
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observation of the spectrum of inelastically scattered light from various media. 
However, such experiments depended on conventional light sources and with 
conventional light sources, it is impossible to obtain monochromatic light. One 
of the sharpest conventional light sources is the 6438 A line of cadmium. But it 
has a line width of order 6 x 10 8 Hz which swamps the details of the spectrum 
of the inelastically scattered light. Also, conventional light sources are very 
weak so experiments often last 10-12 hours. A revolution in the subject 
occurred in the early 1960s with the development of lasers. The neon-helium 
laser had a monochromatic light source with a line width of order 1 Hz and a 
very high power. Light scattering experiments can now measure dynamical 
events on time scales ranging from 10 _1 sec to 10“ 15 sec. 

The equilibrium fluctuations in a fluid can be probed with light or with 
neutrons. Since equilibrium fluctuations in a fluid decay, on the average, 
according to the laws of hydrodynamics, we should be able to obtain 
information about transport coefficients simply by scattering light or particles 
off of fluctuations in a fluid at equilibrium. The fact that this could be done was 
established by Landau and Placzek [20] and by van Hove [21]. In 1934, Landau 
and Placzek showed that it is possible to obtain the time dependence of the 
density autocorrelation function for a fluid from the hydrodynamic equations. 
Later, van Hove, in a very important paper, was able to relate the differential 
cross section for scattering of neutrons from a fluid to the density 
autocorrelation function of the fluid, thus establishing the connection. In this 
section we shall obtain an expression for the intensity of light scattered from a 
fluid in terms of the hydrodynamic modes. 

A light wave incident on a simple fluid induces oscillating dipoles in the 
particles of the fluid. These oscillating dipoles reemit spherical light waves. If 
the medium is homogeneous, all scattered light waves cancel except those in the 
forward direction. However, if density fluctuations exist in the fluid, light will 
be scattered in directions other than the forward direction. 

Density fluctuations are both thermal and mechanical in origin. Thermal 
density fluctuations result from damped temperature or entropy waves, while 
mechanical density fluctuations result from sound waves in the fluid. Light 
which scatters from density fluctuations will be unshifted in frequency, while 
light which scatters from the sound waves will undergo a frequency shift 
(Doppler shift). 

In general, there will be sound waves with a wide range of wavevectors 
ar >d frequencies in the fluid. However, for a given scattering angle, 6, only 
elected sound modes will contribute to the scattering. Light will scatter from 
the wave-fronts of the sound waves and must satisfy the Bragg condition 
(cf- Fig. 10.3) 


2\ s sin - = A 0 


(10.244) 


where \ s is the wavelength of the sound and Ao is the wavelength of the incident 
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Fig. 10.3. Light scattered from a sound wavefront. 


light. The wavevector of the light wave will be shifted by an amount 

Q 

A k = 2 k 0 sin - , (10.245) 

where k Q = (2 tt/X 0 ). The frequency of the scattered light will be Doppler- 
shifted by 

O, — uj — uj q = ± ^ u; ° Vs sin ^ = ±v s Ak, (10.246) 

where cj 0 — ck 0 , v s is the speed of sound in the fluid, and c is the speed of light. 

We will now derive these results. We will first find an expression relating the 
intensity of scattered light to the correlation function for density fluctuations in 
the fluid. Then we will find an expression for the density correlation function 
from the hydrodynamic equations. 

► S10.C.1. Scattered Electric Field [22, 23] 

We wish to find an expression for the intensity of light scattered from a simple 
fluid of identical particles. We shall assume that a monochromatic light wave, 

E(r, t) = E 0 < ? , '( k o r -"°'), (10.247) 

impinges on the fluid, and that the fluid particles have a polarizability a. The 
induced polarization gives rise to a dipole moment density, 

P(r, t) = aE(r, /) ^ ^(r - q ,-(/)), 

(=i 


(10.248) 
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(q.(r) is the position of the ith particle at time t ), which enables the external 
field to couple to the particles in the fluid. In writing in Eq. (10.248) we have 
neglected multiple scattering effects because we are assuming the polarization 
is induced entirely by the external field. 

The easiest way to obtain the amplitude of the scattered light is to introduce 
the Hertz potential Z(r, t). This is done in the following way. The electric field 
E(r, t) is defined in terms of the usual scalar potential 4>(r,t) and vector 
potential A(r, t) as 

E = -V r <f>--^. (10.249) 

In the Lorentz gauge, the scalar and vector potentials satisfy the equations 

d ^ A /Tp 

VjA - sp = -pJ = -ep — , (10.250) 

where p is the permeability of the medium and e is the permitivity of the 
medium, and 


V^-£/ i ^ = -f=V r -P, (10.251) 

where J is the electric current and p is the electric charge density. For the 
system we are considering, the source of electric current and electric charge is 
the dipole moment density, P(r, t). We now introduce the Hertz potential, 
Z(r, t) [22] by means of the equations 

A = £ M — (10.252) 

and 

(f> = -V r • Z. (10.253) 

Comparing Eqs. (10.250)-( 10.253) we find that Z(r, t) satisfies the following 
equation: 


V\Z-e^ = -V. (10.254) 

Thus, we have an equation of motion for the Hertz potential in which the 
polarization acts as a source. 

Given Eq. (10.254) we now find the Hertz potential for the scattered light 
wave. The retarded solution to Eq. (10.254) is given by 


Z( 









(10.255) 
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(cf. Ref. 23). We next introduce the Fourier expansion of the delta function and 
obtain 


Z (M) =\ 




(10.256) 


for the outgoing wave. The outgoing electric field is related to Z(r, t ) through 
the expression 


E'(r, t) = V r (V r • Z) — 


d 2 Z 
dt 2 


(10.257) 


[cf. Eqs. (10.249), (10.251), (10.252)]. 

We only need an expression for the scattered light far from the source. If we 
remember that r / is restricted to the region of the dipoles and r is the coordinate 
of the observer, we can make the following approximation: 

|r — r'| «r — r •!•'■ + •••, (10.258) 

where r is the unit vector r = r/|r|. If we substitute Eqs. (10.248) and (10.258) 
into Eq. (10.256), we obtain 




p poo poo N 

dr' di dJe ^ <5( r' - q,(0) 
J J — oo J — OO ,_J 

/ 


x exp 




(10.259) 


We next substitute Eq. (10.259) into Eq. (10.257) and neglect terms of order 
/ fr and smaller. We then obtain the following expression for the scattered 
electric field: 

E ' (r ’ ,} =\ {£) £ (e ° - ?( * ■ Eo)) J dT ' L d{ L dJJ1 

x exp{/(ko ■ r' - c Jot 1 )} exp t + 

(10.260) 


N 


"(r'.Os 


i=i 


(10.261) 


where 
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is the microscopic density of particles in the medium and we remember that 
c z z= £ 0 fio in free space. 


^ S10.C.2. Intensity of Scattered Light 

The spectral intensity of scattered light is defined as [cf. Eq. (5.87)] 


7(r, cj) = lim — 

T—>oc 11 


■E'(r,o) ; r) ■H!’(r,ur,T), 


where 


E'(r,w;r) 


• oo 

dte iut E'{r,t)9{T - |/|) 

— OO 


(10.262) 


(10.263) 


and 0(T — |/|) is a Heaviside function. If we assume the system is ergodic, we 
can combine Eqs. (10.262) and (10.263) to obtain 


/ (r, uj) = lim ~ ,/^- 
v ' 2T V Mo 


dt 


dt' 


x *M»-0 E '(r ,/) • E '*(r,t')0(T - \t\)6(T - |r'|) 


dre iulT lim - 
T—>oc T 


<M'E'(r,r' + r)-E'*(r,0 


-T 


= I dr^(E'(r,r)-E'*(r,0)). 

Mo J-c 


(10.264) 

The average in Eq. (10.264) is taken with respect to a stationary equilibrium 
ensemble. If we now substitute Eq. (10.260) into Eq. (10.264), we obtain 



x <„( iVMrV')), 


(10.265) 
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where we have let r ■ E o cos0. We next integrate over r and J . This gives 
us 


T ( \ ^ o 


i6 v sin2( 4 r ' J L 4. L dJ ' 

X e i(k 0 r'-^Og-«(kor"W) exp + £ - 1 r • r'j J 

X 


J a tjj 2 


exp | -iu/' ^ - i r • r"^ | (n( r', r')n(r", 4)- 


(10.266) 


We now make the change of variables t = t' — t" and note that 
(n( r 7 , t / )n(r // , t")) can only depend on the difference r 1 = t' — t". We then can 
write Eq. (10.266) in the following form: 


Tt , 1 fa 

/(r ’“ ) = 8\/^ 


a \ 


w) £ ° sin {4>) 


dr' 


dr" 


dTUJ A e~ l(u *-^ T 


x exp ko - / ^ r) • (r' - r")}(n(r',r)n(r")); 


(10.267) 


or with the change of variables p = r' — r" we find 


it \ - 1 

F ’ ^ 8 y po V27 rrc 2 


^) 2 ^si„ 2 (0) J^J^'L/™ 4 

x e - , ( w °- w ) T exp {/(ko-/^f) • p j (n(r" + p, r)n( r")). 


(10.268) 


We next introduce the dynamic density correlation function 


C(p,r) J dr'(n(r' + p,r)n(r',0)), (10.269) 


where A'’ is the number of particles in the fluid, and the spectral density function 
(dynamic structure factor), 


S( k,Q) 


J *oo 

dP .- 0 


drC(p , T)e^e-^ r 


(10.270) 
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In terms of these quantities, the intensity may be written 

I(r,uj) = i (2^27) Nsin 2 ( <t > ) J dp j drC(p,T) 

x exp {i(ko -yf)-pj (10.271) 

where Iq = \ yj {eojpo) £?, k = ko - (wo/c) r and = u 0 - u. 

An expression for the spectral density function, S nn (k, ft), for an isotropic 
fluid was obtained in Exercise 10.4, Eq. (7). If we use that result, we find that 
the intensity of scattered light for an isotropic fluid can be written 

/( r, u) = ^ W ( 2 ^ 27 ) sin2 W) 1 27r N 2 6(Q)6 kt0 
+ ^2 (»k(0)n_ k (0)) 

I f r ^ 2 r * 2 

+ 7\(fi- c 0 it ) 2 + r 2 * 4 + (ft + c 0 it ) 2 + r 2 * 4 

where 7 = (c P /c p ), x is the thermal diffusivity, T = \ [vi + x(7 — 1)], and vi is 
the longitudinal kinetic viscosity. In a typical light scattering experiment for a 
simple liquid, c 0 « 10 5 cm/sec, k « 10 5 cm _1 , x^/ c o ~ 10“ 2 — 10~ 3 , and 
^k/c 0 « 10" 2 - 10~ 3 . 

Thus we see that the scattered light spectral intensity will have peaks at 
frequencies ft = cj 0 and ft = ujq ± c 0 k. As we discussed in Exercise 10.4, the 
central peak (ft = u> 0 ) is due to scattering off of thermal density fluctuations. It 
is called the Rayleigh peak. The two side peaks (ft = uio ± cok) are due to 
scattering of light off of mechanical density fluctuations (sound waves) in the 
fluid. They are called Brillioun peaks. 

The ratio of the intensity, l t h, of light scattered from thermal density 
fluctuations to the intensity, I meC h, of light scattered off of mechanical density 
fluctuations is given by 



Ijh Cp - C p K T - Ks 
Imech Cp 


(10.273) 


where k t is the isothermal compressibility and k s is the adiabatic 
compressibility. Since c P — * 00 and k p — > 00 as we approach the critical point, 
we expect the Rayleigh peak to become very large compared to the Brillioun 
Peaks. In Fig. 10.4, we show experimental plots for the intensity of light 
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Fig. 10.4. Intensity of light scattered from 
C0 2 at an angle of 90° from the incident 
direction, shown as a function of frequency 
shift, 0. The C0 2 is 2.7 °C below the critical 
point, (a) Light scattered from C0 2 liquid, 
(b) Light scattered from C0 2 vapor. Re- 
printed, by permission, from H. Z. Cum- 
mins, Inter. School of Physics, “E. Fermi”, 
vol. XLII, edition R. Glauber Varenna, 
1967. 


scattered from C0 2 at an angle of 90° with respect to the incident direction. The 
C0 2 is at a temperature 2.7 °C below the critical temperature. Note that the 
Rayleigh peak is very wide and high (not all of it is shown) compared to the 
Brillioun peaks. This is because the system is close to the critical point. 


► S10.D. Thermoelectricity [3, 12, 24, 25] 

In 1822, Seebeck found that if two different metals (copper and iron, for 
example) are joined to form a closed circuit and if the metal junctions are kept 
at different temperatures, an electric current will flow in the circuit. The 
presence of the current means that the temperature difference has created an 
electromotive force (EMC) in the circuit. This EMF is now known to be 
microscopic in origin. The electrons in the two different metals occupy different 
energy states. When the metals are placed in contact, electrons from the metal 
in which higher energy states are occupied will flow into the metal in which 
lower-energy states are occupied, thus creating a potential difference at the 
junction. This is called a contact potential and is a source of EMF for the 
circuit. For a closed circuit with two such junctions at the same temperature, 
these contact potentials just balance and no current flows. However, when the 
junctions have different temperatures, the occupied energy levels are changed, 
and the contact potentials in the two junctions become unbalanced allowing a 
current to flow in the circuit. The effect in metals is small. For example, a 
copper-iron junction gives rise to an EMF of about 1.6 x 10 -4 volts for each 
1 °C temperature difference between the junctions. Such a device is called a 
thermocouple. Because the effect is small, thermocouples made from metals are 
not useful for generating electricity, but they are useful in measuring 
temperatures or rapid temperature changes. 

In 1834, Peltier found another effect. If the metal junctions are kept at the 
same temperature, but a battery is connected to the circuit and an electric 
current is passed through circuit, one junction will become hotter (absorb 
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heat) and the other junction will become colder (emit heat). Finally, in 1854, 
Thomson (Lord Kelvin) found a third effect. He found that if a temperature 
gradient is set up along a wire, heat will be absorbed (or emitted depending on 
the direction of the electric current) along the length of the wire. Below we will 
analyze these effects and verify Onsager’s relations for this system. 

One may view a metal wire as an electrically neutral binary system 
consisting of positively charged metal ions at rest and negatively charged 
electrons which are free to move through the metal. (Although the carriers of 
current are negatively charged, we follow the usual convention of depicting the 
direction of electric current flow to be that of positive charges.) A flux of 
electrons can carry both a charge current, 7 = —FJ e \, where F is Faraday’s 
constant and J e \ is electron current (moles/area-time), and an entropy current, 
7 S , (or heat current 77s)- We will assume that the electrons have uniform spatial 
density since the system is electrically neutral and density of positive metal ions 
is fixed. Therefore, the electrochemical potential of the electrons can be written 
in the form n e A = fi e \(T) — F(j>, where is the chemical potential of the 

electrons, F is Faraday’s constant, and 4> is the electric potential. The entropy 
production, a, which results from the flow of electrons in the metal can be 
written 


To = -7 s vr - 7 e iV/4. (10.274) 

The generalized Ohm’s law takes the form 

h = -LssVT - Ls £ V/4, (10.275) 

7 e i = — LfsVT — (10.276) 

As we shall show below, Onsager’s relation, L$e = Fes, is well verified for this 
system. Let us now consider the the Peltier Effect, the Seebeck Effect, and the 
Thomson Effect in that order. 


► S10.D.1. The Peltier Effect 


If two distinct metals, A and B, are placed in contact and kept at uniform 
temperature, and an electric current is passed through the metal junction, an 
amount of heat, Q = nl, (Q has units Joule/area • time) will be absorbed or 
emitted (depending on the types of metal at the junction and the direction of 
current flow). The amount of heat absorbed per unit area per unit time per unit 
electric current is called the Peltier heat , 7f, and is written 


7T = 



vr= o 


(10.277) 


The electric current, /, will be continuous at the junction, but the entropy 
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Fig. 10.5. The Peltier effect, (a) If an electric current, /, flows through the junction of 
two metals, A and B, held at uniform temperature, Q = tt. I will be absorbed (or emitted), 
(b) A circuit at uniform temperature, consisting of two wires, one of metal A and the 
other of metal B, joined together. A battery is inserted between points a and d, and an 
electric current, /, is induced in the circuit. At one metal junction, an amount of heat / 
time, Q , is absorbed; and at the other metal function, — Q is absorbed (emitted). 


current will be discontinuous due to the absorption or emission of heat. As we 
show in Fig. 10.5. a, the relation between the entropy current in metal A and in 
metal B is given by 


77f = TJ* - irl. 


(10.278) 


It is useful to consider a specific example. In Fig. (10.5.b), if metal A is iron and 
B is copper then junction b cools and junction c becomes hotter. If the current is 
reversed, then junction c is cooled and junction b becomes hotter. The 
absorption or emission of heat depends on both the types of metal and the 
direction of the current. Therefore, the Peltier heat is not a Joule heat since 
Joule heat depends on the square of the current. Peltier heat is due to the contact 
potential in the junction. When an electron must “climb” a potential hill in 
crossing a junction, it will absorb energy, thus causing the junction to cool. 
When the electron “decends” a potential hill in crossing a junction, it will emit 
energy, thus causing the junction to heat. 

Let us now consider the circuit in Fig. (10.5.b) which contains two junctions 
between metals A and B (cf. Fig. 10.5b). We assume that the entire circuit is 
kept at temperature T. We connect a battery between points a and d and drive a 
current, /, in the circuit. If Eqs. (10.275) and (10.276) correctly describe the 
system, then the electrical current, /, will be accompanied by an entropy 
current, J$ (/f ), in metal A (B). We can define an entropy transport parameter , 
S*, for metal i, such that 



vt = o 



where i = A,B. 


(10.279) 
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S* is a measure of the amount of entropy carried along with the electron current 
when the temperature in the circuit is uniform. Measurement of the entropy 
transport parameter enables us to determine the ratio of phenomenological 
coefficients (L S e/Lee), for each metal. 

The Peltier heat, n, can be expressed in terms of the entropy transport 
parameters, and Sg. Since from Eq. (10.279) we have [7^] V7 . =0 = 
S* [7 e 'i] vr=o = — we can use Eq. (10.278) to obtain 

* = (io.28o) 

Thus the change in entropy carried by the electrons across the junction has been 
connected to the heat absorbed at the junction. 


^ SI O.D.2. The Seebeck Effect 

Two wires made of different metals, A and B, are connected to form a circuit, as 
for the Peltier effect. However, now the junctions are maintained at different 
temperatures, T, and T + AT, and a potentiometer is inserted at points a and d, 
rather than a battery. Also, the points a and d , and the potentiometer are all 
maintained at a common temperature, 7o, which lies between T and T + AT (cf. 
Fig. 10.6). For this arrangement there will be no electric current flow so / = 0. 
When AT 0, a potential difference, A(j>ad — 4> a — 4>d, is measured between 
points a and d. From Eq. (10.276), J e i = — ^£sV r r — Lee^tI^ = 0 so that 

[VrMeU^o = - v r r = - v r r = -s; v r r, (10.281) 

^EE 

where we have made use of Onsager’s relation, L‘ SE = L‘ ES . 



To 


Fig. 10.6. The Seebeck effect. A circuit consisting of two wires, one of metal A and the 
other of metal B, are joined together and the junctions are held at different temperatures, 
T and T + AT. A potentiometer is inserted between points a and d, and a potential 
difference, A <j>ad = 4>a — 4>d, is measured. 
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We can now obtain a relation between the temperature difference, A7\ and 
the potential difference, A = <f> a - (f>d, measured by the potentiometer. Let x 
denote the spatial coordinate along the wire. We can integrate Eq. (10.281) 
along the wire, but it must be done in segments. In Fig. (10.6) we have marked 
points a,b,c, and d. The change in electrochemical potential as we go from 
point d to point c is 


Similarly, 


( T+AT 

SgdT. 

To 


(^Mel)fcc ~ 

(^Mel )ab = 




T+AT 
To 


SldT , 


S£dT. 


(10.282) 

(10.283) 

(10.284) 


But the change in electrochemical potential from d to a is 


(A^el)*fa — ( A^ e l)c«i (^Mel )bc (^^el)ab 


rT+AT 


(s;-si)dr. 

T 


(10.285) 


Since points a and d are at the same temperature, we obtain (A/l*^)^ = 
— FA^ and 


A — (j) a (fid — p 


rT+AT 

T 


(‘ SI -Si) or . 


(10.286) 


The potential difference depends only on properties of the metal wires and the 
temperature difference between the junctions. For very small temperature 
differences, we can write 


d4>ad _ Sft 7T 

~dT~ F ~ T' 


(10.287) 


The quantity (d^d/dT) is the relative thermoelectric power of metal A against 
metal B. In order to obtain Eq. (10.287), we had to use Onsager’s relation, 
l se = fis- Therefore, the validity of Eq. (10.287) provides a test of Onsager’s 
relations. In Table 10.4, we compare measured values of (d^ad/dT) and ( 7 f/T) 
over a range of temperatures for two different thermocouples. We see that the 
agreement is quite good. 
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TABLE 10.4. Test of Equation, do^/dT = n/T [24, 25] 


Thermocouple 

T(K) 

7 r / fiV\ 

T \K ) 

d(j>ad ( 
dT\K 

Cu-Ni 

273 

18.6 

20.39 


302 

22.3 

21.7 


373 

24.4 

24.9 

Fe-Hg 

292 

16.7 

16.66 


373 

15.6 

15.42 


456 

13.9 

13.74 


^ S10.D.3. Thomson Heat 

A temperature gradient can be induced in a metal wire by placing opposite ends 
of the wire in contact with thermal reservoirs at different temperatures. If the 
wire is connected to a battery, an electric current, /, will flow in the wire. 
Thomson found that in the presence of the electric current, the original 
temperature gradient (in the absence of current) can only be maintained if heat, 
called Thomson heat , is absorbed (or emitted depending on the direction of 
current flow) along the wire. Thomson heat is distinct from Joule heat. 
Thomson showed that in order to maintain the original temperature gradient in a 
wire made of metal A, one must add or subtract an amount of heat, 
dQ = o A (T)IdT, for each incremental change in temperature, dT, along a wire 
of metal A. The quantity 


1 dQ 

<7 A — 

I dT 


(10.288) 


is called the Thomson heat. Thomson heat depends both on the type of metal 
and the magnitude and direction of the electric current, /. 


► S10.E. Entropy Production in Discontinuous Systems 

From Eq. (10.179), we see that entropy is produced when gradients in the 
temperature and in the chemical potential of constituent molecules induce 
spontaneous flows in a system. There are many cases in which these gradients 
are non-negligible only over very short distances. Such a situation occurs, for 
example, when a membrane separates two regions with different chemical 
compositions but which are well-stirred. Another case might be a boundary 
separating two regions maintained at different temperature by external 
reservoirs. For such cases it is useful to compute the rate of entropy production 
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Fig. 10.7. The spontaneous flow of heat or 
particles due to a gradient of temperature of 
chemical potential across a boundary or membrane 
leads to a production of entropy at the boundary. 


per unit area rather than per unit volume. Consider Fig. 10.7, which shows a 
membrane of width A z separating two regions with different temperature and 
chemical potential. We assume that a steady state is set up so that the flux of 
heat and of particles is constant. At the boundary, the chemical potential and 
temperature must be continuous. If they were not, then their gradients, which 
are the generalized forces driving the heat and particle fluxes, would be infinite 
at the discontinuity, and this cannot occur. Gradients in the temperature and 
chemical potential, however, need not be continuous. Inside the membrane, 
there will be a gradient in the temperature of order (AT / Az). There will also be 
a gradient in the chemical potential of contituent i of order (Am/Az). Then the 
entropy/area ■ time, produced by flows across the membrane, is given by 


°* = ' V ■ lo I©* - Jo 2 1 (?) * 

= J A (?) - £ ~ J A (f ) = - ? 

\ / j= l \ / j 


and 


(10.289) 


Tg a - -JsAT - ^2 


(10.290) 


where Js is the entropy current. The quantities AT and A fa are differences in 
temperature and chemical potential, respectively, across the membrane. 


► S10.E.1. Volume Flow Across a Membrane 

Let us consider the transport of a solution consisting of water and an uncharged 
solute (such as a sugar) across a membrane which is permeable to both water 
and solute. The mechanisms by which water molecules and solute molecules 
cross the membrane might be quite different. For example, the water molecules 
may be able to move through small pores in the membrane while the solute 
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Fig. 10.8. A membrane separates two compartments containing different concentrations 
of an aqueous solution of uncharged solutes. The compartments are held at different 
pressure, but have the same temperature. 


molecules have to diffuse through the substance the membrane is made of. We 
will consider a system containing two compartments separated by the 
membrane, and we will assume that the compartments are held at different 
pressures and have different concentrations of solute, but that the temperature is 
uniform through out the system (cf. Fig. 10.8). The chemical potentials of the 
water and solute will be different in the two compartments. This will cause a 
flow of water and solute across the membrane. The entropy per unit area 
produced by this flow can be written 


T(j / 1 — J s A pt r J w Ajl w — J s A[x s J w Afi w . (10.291) 

In this section we will work with molar densities rather than mass densities. To 
evaluate the entropy production, we need expressions for the chemical 
potentials of water and the solute. We shall assume the solute concentration 
is fairly low in both compartments. 

Differential changes in the chemical potential of the water and solute can be 
written 


dfii — 



dT + 



dP + 



dx s , 

T,P 


(10.292) 


where i = w,s and x s in the mole fraction of solute. Since we assume that the 
temperature is uniform throughout the system, we set dT = 0. Also 
{dHi/dP) TyXi = (5V/5n,) r/Jn ^ = v,-, where v,- is the partial molar volume of 
constituent i. Because water is highly incompressible, the partial volumes, v ( , 
can be taken as constant for small changes in pressure. The chemical potentials 
of the water and solute can then be written fi w = fj^(T) + v w P + /?rin(jtH,) and 
Us = p®{T) -t- v s P + /?7Tn(jts), respectively. We can use these expressions to 
compute the difference in chemical potential of the two components. 

For the water, the difference in chemical potential is 


A YV = #4' - 4, = MP" -P‘)+ ^ln (^) . 


(10.293) 



608 


HYDRODYNAMIC PROCESSES NEAR EQUILIBRIUM 


But ln(*H,) = — ln(l +n s /n w ) « -n s /n w — ~(n s /V)/(n w /V) = -v w c s . There- 
fore 


A /x w = v w [AP - RT Ac,] = v w [AP - An] , (10.294) 


where A^ = RT Ac, is the osmotic pressure difference that results from the 
difference in concentrations in the two compartments. 

For the solute, the difference in chemical potential is 


Afi, = IJ," — = V, (P" - P 1 ) + RT In (s) . ( 10.295) 


If we denote Ax, 

Ax, 

,=x II -x I 

Ax, 

lx 11 } ~ , 

Ax,l 

In In 

L4J 

1 1+ tJ 

= x'\ 

s 

'l+*5>+ 

. 2 x‘ + 


Ax* 


Ax, 1 / Ax,\ 

A 2U/7 


Ax, 

’-2Z + 


(10.296) 


where x, is the mean mole fraction of solute in the two compartments. If we 
now note that x, = n s / (n, + n w ) « n s /n w so Ax,/x, = A c,/c„ then 


Afi w 


* - _ Ac, . „ A7 t 

v,AP + RT — - = vvAP + — 

Co Co 


(10.297) 


is the approximate difference in solute chemical potential across the 
membrane. 

Let us again consider the entropy production. It can now be written 


To A = -7, A//, - J w Afi w = -J w v w (AP - An) - J s ^v,AP + — J 

= (TjV, + J w v w )AP - (7,/c, - J w v w ) An = -J V AP - J D An. 

(10.298) 


The current, J v = 7,v, + J w v w , is the total flux of fluid volume (the volume 
current) across the membrane. It has units (volume/area-time). The current 
J D = 7,/c, — J w v w is the volume exchange current. It can also be thought of 
as the relative velocity of solvent to solute, since Jd = v, — v w where 
v, = 7,/c, and v„, = J w v w have units of velocity. Jd is like a diffusion 
current. 
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We can now write the generalized Ohm’s laws for this system: 


Jy = — LppAP — 7T, (10.299) 

J D = -L dp AP - L dd Att. (10.300) 


Microscopic reversibility gives us Onsager’s relation, Lp D = L D p. 

We can explore the implications of these equations. If the concentrations on 
the two sides are the same so Ac s — 0, the osmotic pressure will be zero but the 
hydrostatic pressure imbalance still drives a volume of solution through the 
membrane. The volume current is [J^a^o = —LppAP and the coefficient Lpp 
is called the hydraulic conductance. Even if the osmotic pressure is zero, there 
can still be a diffusion current, [7 D ] A7r=0 = -L dp AP, through the membrane, 
driven by the hydrostatic pressure difference. This diffusion current arises 
because the mechanisms by which the solvent and solute can cross the 
membrane are different. This movement of solvent relative to solute is called 
ultrafiltration , and the coefficient L DP is a measure of the ultrafiltration 
capabilities of a given membrane. 

Now assume that the solute concentrations are nonzero and different in the 
two compartments, so the osmotic pressure difference, A7r, does not equal to 
zero but the hydrostatic pressure difference, A P, is zero. There will be a 
diffusion current, [Jd] ap=0 = -L DD An, where L DD is the diffusional mobility 
of the membrane. There will also be a volume flow, [^v]ap=o = —L pd Ait, 
called osmotic flow, and the coefficient Lpo, is a measure of this osmotic flow. 

Osmotic flow and ultrafiltration are related through Onsager’s relation, 
L PD = L dp . Both of them are measures of the permeability of the membrane to 
the solute. We can see this if we express the solute current, J s , in terms of A P 
and Am It is easy to show that 


Jv + Jd — Js 1 Vj + — 
c 


—{Lpp + Ldp)AP — {Lpo + Ldd)Att. (10.301) 


The condition that a membrane be semipermeable (i.e., totally block passage of 
solute molecules) is that J s = 0 for any value of A P and A7r. This, in turn, leads 
to the condition that for a semipermeable membrane, L PP = L D d = —L P d- We 
can now introduce a refection coefficient, o, for the membrane which is defined 
as a = —{L PD /Lpp). For a semipermeable membrane, a = 1 and solute 
molecules are totally “reflected” from the membrane (i.e., cannot pass 
through). All real membranes will have a reflection coefficient 0 < a < 1. The 
measured value of the reflection coefficient will depend both on the membrane 
and on the solute. For example, in aqueous solution, toad skin has values 
o = 0.89 for solute acetamide and o = 0.50 for solute methanol. The human 
red blood cell has values o = 0.62 for solute urea and o = 0.82 for solute 
malonamide. Dupont “wet gel” has values o = 0.0016 for solute urea and 
cr = 0.036 for solute sucrose. 
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)► S10.E.2. Ion Transport Across a Membrane 

Let us consider a dilute aqueous salt solution which is separated into two 
compartments by a membrane. We assume that there are equal concentrations 
of the salt molecules in the two compartments. The salt molecules dissociate 
into cations and anions. Assume that the membrane is permeable to water and to 
the ions. The two compartments are held at different pressure and an electric 
potential difference is induced across the membrane by inserting electrodes in 
the system (cf. Fig. 10.9). We assume that the temperature is uniform 
throughout. We can write the entropy production due to flow across the 
membrane as 


To A = -J c Apf c - J a Af/ a - J w Afi w , (10.302) 

where J c ,J a , and J w denote the fluxes of cations, anions, and water, respectively 
and Afi e c , A fi e a , and Afi w are their respective differences in electrochemical 
potential across the membrane. 

The electrochemical potentials of the anions and cations, respectively, can be 
written A/i e a = A /i a + z a F4> an d A/jf c = Afi c + z c F0, where A fi a and Afi c are 
the chemical potentials of the anions and cations. Therefore, the entropy 
production takes the form 


To a = -JcA/ic — J a A/j, a — J w Afi w — (J c z c F + J a Za^)A(f). (10.303) 

We can further simplify Eq. (10.303) if we note that the ion currents are related 
to the salt current, J s , since J a = v a J s and J c = v c J s , where u a and v c are the 
stoichiometric coefficients for the dissociation (cf. Exercise 10.8). The chemical 
potential of the salt, fi s , is related to the chemical potentials of the ions through 
the relation fi s = v a A fJL a + v c Afi c . The electric current across the membrane is 
given by / = Za¥J a + z c F7 c . In terms of these quantities, the entropy production 
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Fig. 10.9. An applied electric potential difference induces an electric current and a 
volume current across the membrane, and a pressure difference induces electmosmotic 
pressure between the two compartments, (b) An induced volume flow creates an electric 
streaming current in the circuit. 
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takes the form 


Tg a = -J s Ap s ~ J w Ap w - 1 A<f>, (10.304) 

and we have written it in a form that does not explicitly separate the anion and 
cation contributions. 

Let us now consider the difference in the chemical potentials of water and 
salt across the membrane. Since the concentrations are the same on the two 
sides and the temperatures are the same, the change in the chemical potential 
across the membrane is entirely due to the pressure difference and we can write 
A/Liy = v s AP and A p w = v w AP. The entropy production then can be written in 
the form 


Tg a = —JyAP - lA<f>, (10.305) 


where Jy = v s J s + v w J w is the volume flux of the solution. 

Let us now write the generalized Ohm’s laws for the system. They are 

Jy = —LppAP — LppAcj) (10.306) 

I = -L ep AP - L EE A(f). (10.307) 

Onsager’s argument of microscopic reversibility predicts that L PE = L EP . There 
is a fair amount of experimental data which allows us to check this prediction. 
Two measurements, one of the electroosmotic pressure and the other of the 
streaming current, will give us the information we need. 

If we induce an electric potential drop across the membrane, an electric 
current, /, and a volume current, Jy, occur across the membrane. These currents 
continue until a pressure difference is built up, and the system reaches a steady 
state in which the volume current, Jy equals zero. The volume current induced 
by the electric potential difference across the membrane is called electro- 
osmosis. The electroosmotic pressure, EOP, is defined as 

E 0 P -(bL~S 

(cf. Fig. 10.9a). 

Let us next consider the same fluid and membrane, but replace the battery by 
an ammeter to measure current flow for the case when the induced potential 
drop across the membrane is zero, A cf) — 0. We force fluid through the 
membrane with a piston. An electric current, /, is measured in the ammeter. We 
can define the streaming current, SC, as 

SC= (r) = 7^’ (10.309) 

\ J vJ A0=o L PP 


(cf. Fig. 10.9b). 
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TABLE 10.5. Measurements of the EOP and SC for Aqueous Solutions Using a 
clay Plug as the Membrane [24]. Two different experiments with the same 
concentration are shown. 


Percent 

Concentration 

-1/SC (cm 3 / 2 g 1 / 2 ) 

1/EOP (cm 3/2 g 1/2 ) 

-EOP/SC = L P e/L E p 

0.75% ZnS0 4 

0.386 

0.388 

0.99 

0.75% ZnS0 4 

0.3776 

0.377 

1.00 

1.0% CuS0 4 

0.392 

0.380 

1.03 

1.0% CuS0 4 

0.385 

0.389 

0.99 

1.0% CdS0 4 

0.532 

0.567 

0.94 

1.0% CdS0 4 

0.116 

0.115 

1.01 


The ratio of the EOP and the SC allow us the check the Onsager relation 
since (EOP/SC) = ( Lpe/Lep ). In Table 10.5 we give experimental values for 
this ratio. All the measurements were done using a clay plug as the membrane. 
Since one clay plug will differ from another, care must be taken to use the same 
clay plug in a given set of experiments. Also care must be take not to alter the 
properties of the clay plug in the course of the experiment. The data given in 
Table 10.5 show that Onsager’s relation is well-satisfied for this system. 


► S10.F. Stochastic Hydrodynamics [1] 

The hydrodynamic equations describe the average motion of conserved 
quantities in the fluid. They do not include detailed information about the 
other 10 23 degrees of freedom in the fluid. When we derived the hydrodynamic 
equations in Section (10.B), we used a generalization of Ohm’s law to obtain 
approximate phenomenological expressions for the dissipative heat current and 
the dissipative momentum current (stress tensor) in terms of spatial gradients of 
the temperature and velocity, respectively. We will now add a random 
contribution, g(r, t ), to the heat current and add a random contribution, S(r, t), 
to the stress tensor. In this way we can, to some extent, incorporate the effect oi 
the very rapid nonhydrodynamic fluctuations in the other 10 23 degrees of 
freedom. When these random currents are added, the entropy current takes the 
form 

J,(r,t) = -f V,r(r,/) g(r,r), (10.310) 

and the stress tensor takes the form 

n (r,/) = —2*7 [V r v(r, r )] 1 - CUV r ■ v(r, t) + S(r, t ). (10.31 1 ) 

The random heat current is a vector with components g,(r, 1), where i = x,y, z ■ 
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The random stress tensor is a second-order tensor with components ^(r, t). 
Below we will include these random currents into the hydrodynamic equations 
and use properties of equilibrium correlation functions to obtain expressions for 
correlation functions for the random currents. 


► S10.F.1. Stochastic Hydrodynamic Equations 


When the random heat current and the random stress tensor are incorporated 
into the hydrodynamic equations, the linearized hydrodynamic equations take 
the form 

dAp 

+ A)V r • v = 0, (10.312) 


and 


d\ 


fdP 


dt 


\dp 


Poin = - o- V r A P - — V r AT 


dP\' 


dTJ 


+ riV}y + K + r r? V r (V r • v) - V • S, (10.313) 


Po 


ds\° dAp / ds\° d AT 
dp) T dt Jrf) °\dT) p dt 


K_ 

To 


vjAr-i-v-g. 

'0 


(10.314) 


With the inclusion of random terms, the hydrodynamic equations become 
stochastic equations. As we will show below, we can use the properties of the 
equilibrium correlation functions, (Ap(r h ti)Ap(r 2 ,t 2 )), (v(ri,*i)v(r 2 ,f 2 )), 
and (Ar(ri,? ] )A7’(r 2 , ^ 2 )). to determine the correlation funtion, 
(g(ri,r 1 )g(r 2 ,r 2 )), for the heat current and (S(r 2 , )S(r 2 , / 2 )) for the stress 
tensor. 

It is useful to rewrite the hydrodynamic equations in terms of the response 
functions and the kinetic viscosities and thermal diffusivity given in Eqs. 
(10.61) and (10.62). If we also Fourier transform the space dependence and 
separate the equations into their longitudinal and transverse parts, we obtain 


gjw 

dt 


dpk(t) 

dt 


- ipokv{{t) = 0, 


= +ik—pt(t) + ik^-^T k {t) - vik 2 vl(t) + — sj[(/) } 


A>7 


7 


Po 


+ : » = - 7 ^r k (0 + 4- ft (0 

PoCpl dt dt poc p 




(10.315) 

(10.316) 

(10.317) 


(10.318) 
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and 


^ = +£*"«• (10.319) 

ot Po 

In above equations, we have broken the velocity vector, v k (r), into its 
longitudinal part, vj[(f), and its transverse parts, v^(?) and v£(t), where 

v*(/) = vj[(/)k + + v{l(t) t//. (10.320) 

The unit vectors, t/ and t//, are orthogonal to k and are orthogonal to each other. 
We have also broken the random momentum current into a longitudinal part, 
sj!(f), and transverse parts, $£(/) and S£(t), where 

k • S k (f) = fcs|(f)k + kSl(t) t, + kS{!(t)t n . ( 10 . 321 ) 

Note also that only the longitudinal component of the random heat current 
affects the hydrodynamics. 

The procedure for computing the random current correlation functions in 
volves two steps. First we compute the correlation functions for velocity 
fluctuations and temperature fluctuations for a fluid in equilibrium. Then we can 
use these to compute the random current correlation functions. In the remainder 
of this section, we shall consider fluids in which the thermal expansivity, a P , is 
so small that it can be neglected. That is, we shall set a P = 0 in Eqs. (10.315)- 
(10.319). From Eq. (10.269), we can write 7(7 — 1) = claj,To/c p , so we can 
also set 7 = 1. 


► S10.F.2. Properties of Equilibrium Correlation Functions 

Equilibrium correlation functions must satisfy certain conditions which 
then determine the form of the correlation functions for the random currents. 
Let us consider the correlation function, {A7’(r ] ,u)A7 , (r2,J2))> for the 
temperature fluctuations. From Section 5.B we know that it must be 
stationary. Therefore, (AT^i, 7)AT(r2, t 2 )) = (Ar(ri,u — f2)AT(r2,0)). 
Furthermore, for systems with very large volume, where we can neglect the 
boundaries (and if no spatially varying external fields are present), the 
correlation function depends only on the relative displacement, 17 — r 2 - Thus, 
we can write 

C7r(ri - 1*2,7 - t 2 ) = (AT( ri , 7)A7’(r 2 , r 2 )> (10.322) 

for a system in equilibrium. That is, the correlation function, 
(AT(ri, 7)Ar(r2, t 2 )), only depends on the differences, 17 — r 2 and t\ — 1 2 . 
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Let us now consider the Fourier transform 


(3ki {u\)t kl (u 2 )) = 


dt\ 


dt 2 


dri 


dr 2 e 


-iuj\t\ „-ioJ2h Jk, -n Jk 2 r 2 


x {Ar(ri,/i)Ar(r 2 ,? 2 )>, 


(10.323) 


where the spatial integral is taken over the volume, V, of the hydrodynamic 
system. We assume periodic boundary conditions. Let us make a change of 
coordinates, R = \ (ri + r 2 ), r = rj - r 2 , T = \ (ti + t 2 ), and r = t\ - t 2 . 
Then, 


(T kl (^i)Tk 2 (a; 2 )) 


f oo 


roc 


dT 


dr 


W, 


dre~ il<0Jx + UJ ^ T e -‘(^ -ui)t/2 


X e i(k 1 -k 2 ).r/2 e i(k 1+ k 2 ).R^ Ar ( ri ^ i ) Ar ^ 2> 


= 2nV6{\tL X + k 2 


)^(^i + ^2) | 


dTe~ iuJ ' T (T ki (r)T. kl (0)). 

(10.324) 


We must now define what we mean by these correlations functions. In 
Section 5.E, where we introduced the theory of Brownian motion, we had to 
introduce two different averages. The “random average,” ()^, was an average 
taken using the probability distribution of realizations of the random force. The 
“thermal average”, ( ) T , was a thermal average over possible values of the much 
more slowly varying conserved degree of freedom. We shall use these two types 
of averages to compute hydrodynamic correlation functions. The correlation, 
(T kl (r)T_ kl (0)), that appears in Eq. (10.324), is defined 


(r kl (r)r_ kl (0)> = ({r k| (r)) f r_ k| (0)) T . ( 10 . 325 ) 


We shall assume that the random currents have zero mean so that 


<Sy(r, /)) e = 0 and (g/(r, t ))^ = 0. 

In order to fully determine the correlation function, (7k, (u;i)7k 2 (u; 2 )), we 
must know the expressions for the stationary correlation functions 
(Pk(0)r- k (0)), (v“(0)r_ k (0)), and (|r k (0)| 2 ) [note that T_ k (0) = r k *(0)]. As 


we showed in Section 8.B for the magnetization correlation function, in the 
limit where k — > 0, these correlation functions can be obtained from 
Einstein fluctuation theory (Section 7.C). Einstein fluctuation theory, intro- 
duced in Section 7.C, allows us to determine thermal averages. It is valid in 
equilibrium and far from a critical point so that flutuations are small. Under 
these conditions, it tells us that temperature and density fluctuations are 
statistically independent of one another. We can also assume that velocity 
fluctuations are statistically independent of density and temperature flutuations. 
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Then we obtain 


. wi'toi 2 ), = gsHwiv = g . 

lim{ /Jk (0)r^(0)) T = lim<v“(0)r_ k (0)) 7 = lim<v«(0)v'_ k (0)) r = 0, 

k — >0 k — >0 k — >0 

(10.326) 

where c p is the specific heat. The temperature correlation function in Eq. 
(10.326) was obtained in Section 7.C, and the velocity correlation function is 
given by equipartition [cf. Sect. 5.E]. 

We can now obtain expressions for the time dependence of the temperature 
and velocity correlation functions. We first must find the time dependence of 
the averages, (Tk(t))^(vj[(/))^, and (vjjf (/))«. Let us return to Eqs. 

(10.3 15)— (10.319) and take the “random average,” (L, of those equations. 
Let us also set ap = 0 and 7 = 1 and take the Laplace transform of the 
time dependence, as we did in Section 10.C. For the case when ap = 0, we 
then find 


(U z)) £ = 


r k (Q) 

z + x* 2 ’ 


<*4w>{ = 

(VkW)e = 


zvj|(0) + ikjcl/po)^) 
z 2 + zvik 2 +clk 2 

v k(0) 


z + v t k 2 


and 


(v k (z)) f 


vg(0) 

z + Vtk 2 


(10.327) 

(10.328) 

(10.329) 


The time dependence is now easily found. For the temperature, we obtain 




6-Moo 


dz 


7k(0) 


s-ioo z + Xk 1 


= T k (0)e- xk2 '. 


(10.330) 


For the longitudinal velocity, we obtain 


(vj[(f)) f = 4 ( 0 ) At cos(c k kt) -~sin(c k kt)^ + R vp (t)p k (0), (10.331 , 


where A = ^f,c k = c 0 y / 1 - A 2 /(elk 2 ), and R vp (t ) is a time-dependent term 
that we won’t write here. Since limk- + o(vi[(0)/?_k(0)) r = 0, it does not 
contribute to the equilibrium velocity correlation function. We should note 
also that we have solved Eq. (10.331) exactly given equation (10.328). We have 
not just kept terms up to order, k 2 , in the denominator as was done in Eq. 
(10.73). It is necessary to keep the full k dependence of Eq. (10.331) in order to 
obtain the correct expression for the random current correlation functions. For 
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the transverse velocities we find 

< v k(0>e = v£(0)e~‘ / ' t! ' and <v''(f)) { = v"(0) e -^'. (10.332) 

If we now substitute Eq. (10.330) into Eq. (10.324), we obtain 

(f k ,(^)f k2 (^)) = 2vV6(k, +k 2 )%, + a>z)(|r t , (o)[ 2 ) r ■ 

(10.333) 

Similarly, we obtain the following expression for the correlation function for 
the longitudinal component of velocity: 

= 27rV<5(ki +k 2 )£(o; 1 + u; 2 )(|vj[ i (0)| 2 ) r 

x I " LQ\&\/(ck x k\) mAi/(c kl ki) j (10.334) 

[(wi - c kj ki ) 2 + Aj (u>\ - c kl ki ) 2 + A?J 

For the transverse velocity fluctuations we obtain 

(Vk,( w i)vk 2 (^)> = 27rV < ?(k 1 + k 2 )6(un + ^)(|v^ (0)[ 2 ) r ^ 4 ) > 

(10.335) 

with a similar expression for (v^ (a^v^u^)). Equations (10.333) and (10.334) 
apply to an equilibrium fluid in which thermal expansivity can be neglected so 
that a P = 0. With the information given in Eqs. (10.333)-(10.335), we can 
obtain the correlation functions for the random currents. 


► S10.F.3. Random Current Correlation Functions 

We now will obtain expressions for the random current correlation functions for 
the case when ap « 0. From Eqs. (10.316)-(10.319) we obtain the following 
expressions for the random currents: 


4M = k {^ Q u j[(u>), 

(10.336) 

£k M = P k 9 (u ixk 2 ) T k (u ) , 

(10.337) 

^k(w) iv t k 2 )v[.{u), 

(10.338) 
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S'V) =j( u ~ ^tk 2 )v^(uj). (10.339) 

From these equations we obtain the following correlation function: 

<s“>,)s“> 2 )> = {f U - L_ c M _ , •*,,**) 

kik 2 \ lo\ J\ u 2 V (10.340) 

x (vll, (^i)vl[ 2 (^ 2 )>, 


(gki (^l)gk 2 (^2)) 


2 ~2 

(10.341) 


and 


(*^k, ~ +i u tk\]{-u 2 + (^1)^2 (^2))* 

(10.342) 

The expression for (^((wi)^^)) is the same as Eq. (10.340), but with v 7 
replaced by v 11 

Let us now substitute Eqs. (10.333M10.335) into Eqs. (10.340)-(10.342). 
We then obtain 

('Sk,( a; 1 )^k 2 ( a, 2)) = 47rk B Tp 0 vi6(ki + k 2 )<5(u;i + u; 2 ), (10.343) 

(^k, (^i)S£ 2 (u; 2 )) = 4 nk B T p 0 v t 6(ki +k 2 )<5(wi + wj), (10.344) 

and 

<£k, (wi)fta(wi)) = 47rfc B T 2 A:<5(k 1 + k 2 )%! + u; 2 ). (10.345) 

Equations (1 0.343 )-( 10.345) are an expression of the fluctuation-dissipation 
theorem for fluids in equilibrium. We have written the correlation functions for 
random current fluctuations in terms of the transport coefficients which are 
measures of internal friction or dissipation in the fluid. 

We need to obtain expressions for correlation functions of arbitrary 
components of the random heat current g,(r, t), and of the random stress 
tensor, Sy(r, t). Let us first consider their Fourier transforms, &( k,u>) and 
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Sij(k,u), respectively. For an isotropic fluid, such as we are considering, the 
correlation functions (gi(ki,ui)gj(k 2 ,u 2 )) and (Sy (ki , u\ )S k [(k 2 , u 2 )) must be 
invariant under rotation of coordinates. This restricts the correlation function for 
the random stress tensor to depend on terms with tensor character, 6 y 6 u, 
8^1 + Su&jk* and 8 ik 8ji - SuS#. However, the random stress tensor cannot ontain 
an antisymmetric part for a fluid of point particles (which we are considering 
here). Therefore, the correlation function (Sij(k\ , uj\ )Su(k 2 , u 2 )) must have the 
form 


(5y(ki , u\ )5fc/(k 2 , u; 2 )) = 4ttIcbT p 0 [A(8 ik 8ji + 8u8 jk ) + B{8 ij 8 k i)\ 

x <5(ki + k2)<5(a;i + u; 2 ), 


where A and B are constants to be determined. Let us now consider the special 
case in which the wavevector, k, lies in the z direction. Then ti = x, $2 = y, and 
k = z, and we find 


(5 l 33(ki,a;i)5 , 33(k2,a;2)) — 

= 4nk B T pq(2A + fi)<5(ki +k2)<5(^i + 0 ^ 2 ) 


(10.347) 


and 


(53i(ki,o;i)53i(k2,a;2)) = {££, (^ 1 )^( 0 ^)) 

= AirliftT p$A 8 (k\ + k 2 )<5(o;i + u 2 ). 


(10.348) 


If we now compare Eqs. (10.347) and (10.348) with Eqs. (10.343) and (10.344), 
we find that A = v t and B = - 2 u t . Thus we finally obtain 


(5g(ki, wi)5 w (k 2 ,W2)) = 4nk B T[r)(8i k 8ji + 6 a 8 jk ) + (C - 5 v)(hAi)\ 

x (5(ki + k 2 )<5(o;i + u 2 ). (10.349) 


Similarly, in order to be invariant under rotation of coordinates, the random heat 
current correlation function, (g,- (k 1 , u>i )gj (k 2 , u; 2 ) ) , can only depend on Sy. 
Therefore, we find 

(gi(kuWi)gj(k 2 ,u 2 )) = 47rk B T 2 8ijK8(ki + k 2 )8(u\ + u 2 ). (10.350) 

Since thermal fluctuations are statistically independent of velocity fluctuations, 
we also have (g i (k 1 ,u>i)SH(k 2 ,u>i)) = 0. 

We can also find the space and time dependence of these correlation 
functions. If we Fourier transform them, we obtain 

(Sij{ruh)Ski{r 2 , t 2 )) = [vWfi + 8 a 8j k ) + (C - f »?)(M«)] 

x 6(ri - r 2 )6{ti - t 2 ) 


(10.351) 
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and 

(£i( r i 5 *i)£/( r 2 >* 2 )) = ~^-6ijK6(r { - r 2 )8(t l - t 2 ). (10.352) 

These equations give the random current correlation functions for an isotropic 
fluid of point particles in which the thermal expansivity, a P equals zero. Notice 
that they are “delta-correlated.’ That is, they have extremely short range. This is 
a reflection of the fact that these fluctuations occur on a much faster time scale 
and over much shorter spatial distances than do those of conserved quantities. 
For a discussion of the effect of noise on a broad class of systems see Ref. 26. 


► S10.G. Long-Time Tails [27-36] 

In Section 5.E we reviewed the traditional theory of Brownian motion in which 
a Brownian particle of radius R is assumed to move under the influence of a 
constant friction force (the Stokes friction, 67rr)R) and a random white noise, 
both due to the medium in which the Brownian particle moves. Under such 
conditions, the velocity autocorrelation function of the Brownian particle 
decays exponentially with time [cf. Eq. (5.84)]. However, in the 1960s, 
molecular dynamics computer simulations by Rahman [27] of liquid Argon 
showed significant deviations from exponential behavior in the velocity 
autocorrelations function of Argon atoms. His results are shown in Fig. 10.10. 

Alder and Wainwright [29] used computer simulations to compute the 
velocity autocorrelation of hard-sphere particles in a hard-sphere gas. They also 



Fig. 10.10. The velocity autocorrelation function for argon atoms in liquid argon. The 
dotted line is from numerical simulation of Rahman. The solid line is the exponential 
decay predicted by traditional Brownian motion. Reprinted, by permission, from R 
Zwanzig and M. Bixon, Phys. Rev. A2, 2005 (1970). 
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Fig. 10.11. The flow pattern around a hard-sphere particle in a hard-sphere gas. 
Reprinted, by permission, from B. J. Alder and T. E. Wainwright, Phys. Rev. Al, 18 
(1970). 


found deviations from exponential behavior. One of the more striking features 
they found was the long-time tail in the velocity autocorrelation function. In the 
long-time limit, the velocity autocorrelation function decayed as f“ 3 / 2 rather 
than exponentially with time. They observed vortices in the flow pattern around 
the Brownian particle (cf. Fig. 10.11). As the Brownian particle moved through 
the fluid, it left a memory of its motion which could influence its later 
movement. This complicated flow around the Brownian particle is a direct 
consequence of hydrodynamics. In the sections below, we shall whos how the 
long-time tails emerge as a natural consequence of hydrodynamics. 


► S10.G.1. Fluid Flow Around the Brownian Particle 

The linearized hydrodynamic equations were given in Eqs. (10.36)-( 10.38). If 
the Fourier transform their time dependence, we can write Eqs. (10.36) and 
(10.38) as 

- iup u {r) + PoVr • v w (r) = 0, (10.353) 

~ *n>P 0 Vu»(r) = ~V r /y r) - TjV r x (V r x v w (r)) + (C + |r/)V r (V r • v w (r)), 

(10.354) 

where we have used the vector identity, V 2 v = V r (V r • v) — V r x (V r x v). 

Let us now assume that a spherical Brownian particle of radius R moves 
through the fluid with velocity u(f) = Let us also assume that the surface 
of the Brownian particle is very rough so the fluid sticks to its surface. Then at 
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the surface the velocity of the fluid equals the velocity of the surface. To find the 
force exerted on the Brownian particle by the fluid, we must first find the 
velocity distribution of the fluid in the presence of the moving Brownian 
particle. With that solution, we can construct the pressure tensor, P (cf. Section 
10.B). Then the force on the Brownian particle is given By 


F = <MS-P = <MSr-P 


(10.355) 


Below we will solve the hydrodynamic equations for the velocity of the fluid in 
the presence of the Brownian particle subject to “stick” boundary conditions. 
We shall assume that the fluid has velocity v w (r) = at r = R [u^ is the 
Fourier transform of u (*)], and it has velocity v w (r, t) = 0 when r = oo. Also at 
r = oo, all derivatives of the velocity are zero. 

For simplicity, we will consider an incompressible fluid so that V r • v w = 0. 
The equation for the velocity flow reduces to 

-iupoV u (r) = -VrPu,(r) - rjV r x (V r x v w (r)). (10.356) 

The velocity, v w , can be written v w = V r x A w , where A w is a velocity “vector 
potential.” Let us note that is a polar vector. In order for the combination 
V r x A u to form a polar vector, A w must be an axial vector. There are two 
vectors in the problem from which we can build this axial vector. They are the 
radial vector, r, and the velocity, = uji. Let us write A w = (dg{r)/dr) rx 
u^, where dg{r)/dr is a function of the distance, r, from the center of the 
Brownian particle. It must be determined from boundary conditions. If we 
note that [dg{r)/dr\ r = V r g(r), then the velocity vector potential takes the 
form A u = (V r g(r)) xu w = Vx (g(r)u w ) and the fluid velocity can be 
written 


= Vr X Vr X (g(r)Ua,). (10.357) 

Let us now take the curl of Eq. (10.356). We find 

-iupoVr XV U = —77 V r xV r x (V r X V w ) = 7/Vj(V r x v w ) (10.358) 
where 

V r x v„(r) = -Vj(V r x g (r)u„) = 0, (10.359) 

Equation (10.358) then reduces to 

(k 2 V 2 r Vg(r) + V 4 r Vg(r)) x u„, (10.360) 

where k 2 — iuJpo/r) and k = ^iopoflr/ ( 1 + <). Thus, we must solve the 
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equation 


Vr[Vj g (r) + * 2 V^(r)] = 0, 


(10.361) 


subject to the boundary conditions of the problem. If we integrate Eq. (10.361) 
once, we obtain 

VU{r) + k 1 V 1 r g(r) = C ) (10.362) 


where C is an integration constant. Since all derivatives of the velocity must be 
zero at r = oo, we must have C = 0. We must find a solution to Eq. (10.362) 
such that Vjg(r) and all derivatives of Vjg(r) go to zero at r = oo. These 
conditions are satisfied by the solution 


V 2 *(r)=- — = 

r8{ ’ r 2 dr dr r 


dg ci Jkr 


where c\ is a constant. 

Let us now integrate Eq. (10.363) once to find 


dg = J_ 

dr r 2 



+ C 2 


From Eq. (10.357), the fluid velocity can be written 


„ / \ liiu cos (6) dg - U u sin (0) ( dg d 2 g\ 

v.W = -r - r Tr + 0— —\9 r + r -fri)- 


(10.363) 


(10.364) 


(10.365) 


The constants ci and ci in Eq. (10.364) can be found from the “stick” boundary 
condition at the surface of the Brownian particle, \ U (R, 6) = u^. This boundary 
condition, together with Eqs. (10.364) and (10.365), yields 

Cl= S e "'“ and C2= ? (,a366) 

We now are in a position to find the drag force on the Brownian particle due to 
its movement through the fluid. 


^ SI O.G.2. Drag Force on the Brownian Particle 

To find the drag force on the Brownian particle, we must perform the 
integration in Eq. (10.355). The first step is to write the pressure tensor. Eq. 
(10.356) can be written 


V r FL( r) = iupoY^r) - 7?V r x (V r x v w (r)). 


(10.367) 
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If we use the results from Eqs. (10.364)-(10.366), we find the following 
expression for the pressure 


//A (3R 3 ikR 2 k 2 R 3 \ /1n ,^ 

P = V u w cos (0) ~ 2 ^~J ■ ( 10 - 368 ) 


From Eq. (10.355), the drag force on the Brownian particle is given by 


F = R 2 sin (0)d0 
o 


2n 


d(f) (P rr r + P r $0 + P r<t>(p) 5 


where components P rr and P r e of the pressure tensor are given by 


d n O dv r , D (\ dv r , 8v 0 Vf) 

Prr = P-2 V - and P re = - V l-- + — -- 


(10.369) 


(10.370) 


Due to the geometry of the problem, there is no contribution from P r( p. On the 
surface of the Brownian particle we find that ( dv r /dr) r=R = 0, so 

//rt f 3 3 ik k 2 R\ „ . /m / 3 3 ik 

P rr = T)Uu COS (6) ( — — ) HIld Pr0 = Sln W ( 2 R~~2 

(10.371) 

on the surface. 

We can now substitute these results into Eq. (10.369) and integrate. 
However, before we do the integration we must expand the unit vectors r and 0 
in terms of Cartesian unit vectors. The directions of r and 0 vary over the 
surface while the Cartesian unit vectors do not. Note that r = sin (9) cos (0)x+ 
sin (6) sin (<f >) y + cos (6 ) z and 0 = cos ( 0 ) cos (4>)x + cos (9) sin(0)y- 

sin (9) z. If we perform the integration in Eq. (10.369), we obtain 


F — 6-KTjUu Rz, 1 — ikR 


k 2 R 2 


(10.372) 


In the frequency limit u> — > 0, the drag force reduces to the usual Stokes friction 
force, namely, F = (mrjuRz. The Stokes friction can only be used to describe 
Brownian particles which are moving very slowly. Below we will compute the 
velocity autocorrelation function for a Brownian particle which is bouncing 
around rather rapidly. Then frequency-dependent terms cannot be neglected. 


^ S10.G.3. Velocity Autocorrelation Function 

As we can see from Eq. (10.372), the friction force for a Brownian particle in a 
fluid depends on the frequency. This means that it has memory. The equation of 
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motion for the Brownian particle can be written 

m^+ f” a(t-t'W)dt' = F rand (t), (10.373) 


where a(t) is a memory-dependent friction coefficient whose Fourier transform 
is given by Eq. (10.372). Namely, 


a(u;) = 


r oo 

dta(i)e luit = 6n rjR 

— 00 


^1 - ikR - 



(10.374) 


The friction force is assumed to be causal so that a(t) = 0 for t < 0 and 
a(t ) ^ 0 for t > 0. In Eq. (10.373), F rand (t) is the random fluctuation force of 
the medium on the Brownian particle. We assume that the random force has 
zero average value, (F rand (t)) F = 0, where () F is an average over realizations of 
the random force. Because there is memory, the random force will not be 
“white.” 

If we make use of Eq. (10.374), the Fourier transform of Eq. (10.373) can be 
written 


—im'uu(uj) + Cm(u;) — (Ai\f—iuju(uj) — F rand (cj), (10.375) 

where A = y/(R 2 po/rj), = m + = 6nr)R, and M = f 7T/9o^ 3 is the 

mass of the displaced fluid. If we take the inverse Fourier transform of Eq. 
(10.375), we find 


, du(t ) N (A 

m _ + c„(r)+-^ J 


f 1 du 

ds — 

—oo 'v t s ds 


— Frand{t) ■ 


(10.376) 


Equation (10.376) is the Langevin equation for a Brownian particle of radius R 
in an incompressible fluid assuming “stick” boundary conditions. There are 
two interesting features. First, the particle has a memory-dependent friction 
coefficient as a result of the complicated flows around the particle. Second, and 
somewhat unexpectedly, there is a dressing of the mass of the Brownian 
particle. We will talk more about that below. 

Let us now construct the velocity autocorrelation function for the Brownian 
particle. We first average Eq. (10.376) over the random force and obtain 


m 


,d( u(t)) F 
dt 


CA 

+ C M*))f + ^ 


ds 


1 d(u{s)) l 


VT= 


ds 


= 0. (10.377) 


The velocity autocorrelation function is defined as 

C uu (t ) = (( u{t)) F u(0 )) T , 


(10.378) 
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where () T denotes a thermal average over the initial conditions. Therefore, in 
order to compute the velocity autocorrelation function, we must find ( u(t)) F for 
a given initial condition, w(0). This can be done by means of a Laplace 
transform. In Exercise 10.9, we take the Laplace transform of Eq. (10.377). We 
then find 


c„„(r) - 2m . 


6+ioo 


dze z 


5—IOO 


”* / (1 m (Q)1 2 )t’ 
m'z + C + 


(10.379) 


EXERCISE 10.9. Take the Laplace transform of the equation 


/ df(t) 


CA 


m^ r + </W + A 


Answer: Let us first note that 

• 5+ioo 


ds 


1 df(s) 


\Jt — s ds 


= 0. 


and 


r 


i PO+lOO 

f(0= dze zt f(z) and f(z) 

2m h-ioo 


dte- a ^Q = zf(z)-f( 0) and df ^ - 1 


dte zt f{t). (1) 


dt 


dt 27 xi 


6+ioo 


6—ioo 


dze zt zf{z ). 


(2) 


Also, we need to find the Laplace transform I(t) = 
ds (l/y/t — s) [df(s)/ds]. We can write 


J 1 OO P 00 P t 

dte~ zt I (t) = dte~ zt ds 
0 JO J— oo 


1 df(s) 


y/t ~ S ds 


i p 6+ioo oo 

- dz' 

Ji— /oo Jo 


2ni J 

Let us note that 


dte 


ds 


( 3 ) 


-z' ( t-s ) 


—oo 0 


zf(z). 


^ e -zf(t-s) _ f dx_ z > x _ /7F 

J-ooVT^ Jo ~VV’ 


(4) 
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where x = t — s. Equation (3) then takes the form 

1 r 6+i'co /*oo r~ 

7(Z) = 2^L/ Z 'i (5 > 

This expression for I(z) is well-behaved provided that Re(z) > Re(z'). Then 
we can perform the integration over time to obtain 


I{z) = 


__L f 


6+ioo 


2t ri J 5 _ IC 


d{ 


z — z 




( 6 ) 


Let us now consider the contours in Fig. 10.12. The integrand in Eq. (6) has 
a pole at z! = z which is to the right of the contour C\. Therefore, to perform 
the integration, we close the contour to the right and pick up the contribution 
from that pole. We then find 


I(z) = Vnzf(z). 

If we now combine the above results, we obtain 

m'/(0) 


7 


m'z + C + 


( 7 ) 


( 8 ) 


To obtain the final expression for the velocity autocorrelation function, we 
must compute the thermal average, (|«(0)| 2 ) r . First note that the mass, m ' , of 
the Brownian particle is now a “dressed mass.” In order to obtain the correct 



Fig. 10.12. Branch cut and integration contours. C i is the Bromwich contour. 
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diffusion coefficient, D, we must use the dressed mass, m! , when defining the 
thermal average. We write (|«(0)| 2 ) r = ( k B T/m '). The velocity autocorrelation 
function then becomes 


C uu (t) = 


k B T 

2m 


1 


■ 5+ioo 

dz e zt — — — — — . 

s-ioo m'z + C + C&VZ 


(10.380) 


The Laplace transform of the velocity autocorrelation function is given by 


C uu (z) = 


k B T 

m'z + c + 


(10.381) 


It has the property that lim z ^qC uu (z) = D = k B T/( *, where D is the Einstein 
diffusion coefficient. 

Let us now evaluate the integral, Eq. (10.380). The integrand has a branch 
point at z — 0. We draw the branch cut along the negative real axis as shown in 
Fig. (10.12). Then we integrate along the contour C 2 . The contribution from the 
small semicircle at the origin is zero because there is no pole at the origin. The 
correlation function is then found to be 


Cuu(t) = 


k B T 


7T 


f 


dxe 


-x\t\ 




(c -m'x) + C 2 AV 


(10.382) 


This integral must be evaluated numerically. However, some limiting cases can 
be obtained analytically. 

We can obtain an analytic expression for the long-time behavior of the 
velocity autocorrelation function. The long-time behaviour is determined by 
small values of x since contributions for large x are exponentially small. Thus 
we can obtain the long-time behavior by expanding the integrand in powers of 
y/x. We find [29-34] 

c^)=~ [°° dx\/x e - * 1 ' 1 + • • • = pf; - + ••-, (10.383) 

< Jo 2yfi\t \ 3/2 

where D = k B T/6nr]R is the Einstein diffusion coefficient. Notice that the 
velocity autocorrelation function for the Brownian particle does not die off 
exponentially, but has a long-time tail. It decays with time as r -3 / 2 . 

We can also compute the variance of the Brownian particle. Let us remember 
that x(r) — jc( 0) = Jo dt' v(t'). Then for a Brownian particle in equilibrium with 
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the surrounding fluid, we obtain 

(Ax 2 ^)) = ((*(/) -*(0)) 2 ) 


dt\ 


dt 2 ( u(ti)u(t 2 )) 



>2{t-T) 

r t/2 

[ 2T \ 

dT 

dr + 

dT 

dr j 

Jt/2 

2 (T-t) 

0 

-2T J 



• f 

't—r/2 

2 

dr 

dTC uu ( r) = 2 

. 

0 

r/2 


dr (t — r) C uu (r) . (1 0.384) 


In the fourth term we have made the change of variables, r = t 2 — h and 
T = i (ti + t 2 ). In the fifth term, we have changed the order of integration. For 
example, J dT J 2(f_:r) dr = J 0 ? dr J)'~ t/2 dT. For long times the variance 
takes the form 


<A x 2 (t)) = 2Dt-^t t 1/2 + 
v 7r 


(10.385) 


Thus, in the very long time limit it approaches the expected value. 

It is also useful to define a “time-dependent diffusion coefficient” given by 


1 d(Ax z (t)) 

D(t) = 2 dT 1 


dTC uu {r). 


(10.386) 


For long times, the time-dependent diffusion coefficient has the form 


. DA fn 


(10.387) 


The long-time tail in the velocity autocorrelation function for a Brownian 
particle with radius R = 1.7 microns in water has been measured in light 
scattering experiments by Paul and Pusey [36]. The light scattering experiments 
actually measure the variance, (Ax 2 (t)), because the correlation function for the 
intensity of the scattered light, (/(*)/( 0)), can be related directly to the variance 
through a relation of the form (7(t)/( 0)) «1+C exp ( -K 2 (Ax 2 (t ))), where C 
and K are constants which depend on the medium [36]. We show the results in 
Fig. 10.13. Curve A is the result for the case when the friction coefficient, 
a = 6nr)R, has no memory. Curve B is the prediction of Eq. (10.385). The black 
dots are the result of experiment. While the results of experiment qualitatively 
follow the theoretical predictions, they differ by a systematic quantitative 
amount. One possible source [4, 1982] for these deviations is the rotational 
degrees of freedom of water. The deviations between theory and experiment in 
Fig. 10.13 have been used to estimate the rotational transport coefficients in 
water. 
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Fig. 10.13. Measurements of the time-dependent diffusion coefficient, D{t) (in units of 
cm 2 sec -1 ), versus time (in seconds). Curve A is the predicted result using the constant 
Stokes friction coefficent, a = 6-irqR. Curve B is the prediction of theory if memory 
effects are included. The black dots are measurements for a Brownian particle in water. 
Reprinted, by permission, from G. L. Paul and P. N. Pusey, J. Phys. A: Math. Gen. 14, 
3301 (1981). 

As we mentioned at the beginning of Section S10.G, the deviations from 
exponential behaviour observed by Rahman [27] for the velocity autocorrela- 
tion function of argon atoms moving in argon have been reproduced 
theoretically by Zwanzig and Bixon [28] using the hydrodynamic equations 
to compute the time evolution of the velocity autocorrelation function, 
assuming that the Brownian particle is an argon atom. For argon atoms in 
argon, there will be very high-frequency components to the Brownian motion. 
Zwanzig and Bixon therefore included effects due to the compressibility of the 
fluid. They also included viscoelastic effects. For high-frequency motion, the 
liquid begins to take on some of the rigidity of a solid. Some of this rigidity can 
be accounted for by including frequency-dependent contributions to the 
viscosity. The calculation of the memory terms in the friction coefficient then 
proceeds in the same manner as described above. Zwanzig and Bixon, however, 
used slightly different boundary conditions. For Brownian particles as small as 
atoms, they used “slip” boundary conditions, rather than the “stick” boundary 
conditions that we used above. Slip boundary conditions mean that no torques 
can act on the Brownian particle. Also, for the case of a hard sphere, the normal 
component of the velocity of the fluid at the surface of the sphere is equal to the 
normal component of velocity of the sphere (no fluid can flow into or out of the 
sphere). A comparison of the results of the Zwanzig and Bixon with Rahman’s 
numerical experiments is shown in Fig. 10.14. The qualitative agreement is 
quite impressive. This shows that even on the molecular scale, the conservation 
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Fig. 10.14. The velocity autocorrelation function normalized to unity at the initial time. 
The dotted curve is Rahman’s data. The solid curve is theory. Reprinted, by permission, 
from R. Zwanzig and M. Bixon, Phys. Rev. A2, 2005 (1970). 

laws which give rise to hydrodynamic behavior play a fundamental role. A 
systematic study of the effects of compressibility and viscoelasticity on 
rotational and translational Brownian motion is given in Ref. 37. The effect of 
hydrodynamic memory on the dielectric response of a Brownian particle in a 
nonpolar fluid was studied in Ref. 38. 


► S10.H Superfluid Hydrodynamics [1, 39-43] 

In this section we shall derive the hydrodynamic equations describing He(II) — 
that is, liquid He 4 below the A line (cf. Sections 3.D and 3.F). We assume that 
He(II) is composed of two interpenetrating fluids: (1) a normal fluid which 
behaves somewhat like a classical fluid and has velocity v„ in the laboratory 
frame and (2) a superfluid which carries no entropy and has velocity \ s in the 
laboratory frame. We shall first obtain the hydrodynamic equations for the 
system and then obtain dispersion relations for first and second sound in the 
superfluid. 


► S10.H1. Superfluid Hydrodynamic Equations 

It is useful to first write the balance equations for He(II). The mass balance 
(mass conservation) equation is given by 


| + V.J = °, 


(10.388) 
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where p is the total mass density and J is the total momentum density of the 
fluid. The total mass density can be written p = p n + p s , where p n is the mass 
density of the normal fluid and p s is the mass density of the superfluid. The 
momentum density of the fluid can be written J = p n \ n + p s \ s , where v n is the 
normal fluid velocity and \ s is the superfluid velocity. It is important to note that 
the superfluid and normal fluid phases cannot be separated in real space. The 
superfluid phase results from the condensation of part of the substance into a 
single quantum state. 

The momentum balance equation is 

U + vp + v • n* + v • n D = o, (io.389) 

where P is the hydrostatic pressure, A* is the reversible stress tensor 
(momentum current), and A D is the dissipative stress tensor. 

The energy balance equation is 

OT T 

-fa + V • (Jf + J?) = 0, (10.390) 

where and are the reversible and dissipative energy currents. 

The entropy balance equation is 

^ + V-(pjv„ + j£) = ( 7, (10.391) 

where 5 is the specific entropy, ps\ n is the reversible entropy current, jJ is the 
dissipative entropy current, and a is the entropy production. The reversible 
entropy current only depends on the normal fluid velocity, v„, because the 
superfluid does not carry entropy. 

In Section S3.B we found that the superfluid velocity is driven by gradients 
in the chemical potential. Thus we write 

— = ^ + (v, • V)v, = -V(/ + pP), (10.392) 

where d\ s /dt is the convective time derivative of the superfluid, and p R and p D 
are reversible and dissipative contributions to the chemical potential. 

It is useful to consider the fluid in the rest frame of the superfluid. Let Eo 
denote the energy in the superfluid rest frame. Then differential changes in the 
energy are given by 

dEo = pdp + Td(ps) + wdJo , (10.393) 

where Jo = J — py s is the momentum density of the fluid in the superfluid rest 
frame and J is the momentum density in the laboratory frame of reference. The 
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chemical potential is p = (dE 0 /dp) psJo . The temperature is T = (dE 0 /dps) fl Jq . 
The new quantity, w, is a velocity which is defined w = (dE 0 / d Jo) pps . 

The total energy, E, in the laboratory frame is related to the energy, Eo, in the 
rest frame of the fluid by a Galilean transformation 

E = Eq + v s • Jo + \p^l = Eq + \ s • (J — p\ s ) + \pv 2 s - (10.394) 

In the discussion below, we shall consider the fluid to be a function of the 
variables, p,ps, J, and v*. From Eqs. (10.393) and (10.394), a differential 
change in energy, E, can be written 

dE = (p - w • \ s - \v])dp + Td(ps) + (w + \ s ) ■ dj + (J - p(w + v,)) • d\ s . 

(10.395) 

It is found experimentally that if the velocities of the fluid are too great the 
superfluid phase can be destroyed. Therefore the superfluid equations are only 
useful for low velocities and we can neglect convective contributions to the 
hydrodynamic equations. As a first step we will reexpress Eq. (10.395) in terms 
of partial time derivatives rather than total time derivatives since total time 
derivatives contain contributions due to convection. Thus we write 


dE 

dt 


dp 

dt 


~=(/i-w-v i -iv i 2 )^+r^ + (w + v i )-^+(J-^(w + v,)) 


dj 

dt 


d\ s 
dt 
(10.396) 


As a next step we will substitute the balance equations, (10.388)-( 10.392), into 
Eq. (10.396) and eliminate terms which are cubic in the velocity. This gives 

= - Vr ‘ (J* + J?) + ^ Vr ' J + ( w + Vs) • VrP + (w + \ s ) • (V r • fl D ) 


+ (J - p(y + w)) • Vr (p R + P D ). 


(10.397) 


The reversible stress tensor does not appear in Eq. (10.397) because it is a 
momentum current which is explicitly quadratic in the velocity and therefore 
leads to contributions in Eq. (10.397) which are cubic in the velocity. 

Let us now write the Gibbs-Duhem equation in the superfluid rest frame, 


dP = pdp + psdT + Jo • dvr. (10.398) 

If we substitute the Gibbs-Duhem equation (10.398) into Eq. (10.397) and 
neglect terms cubic in the velocity, we obtain 

r lT = - Vr • [••§ + Jf - - (w + v *) ■ n°] + (J - p(y s + w) • v r (// - p) 

+ (J - p(y s + w)) • Vr/i D + ps{\ s + w) • V r r - n D : V r (Vj + w). 

(10.399) 
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There is one final step necessary to put the entropy balance equation into the 
form Eq. (10.391). If we divide Eq. (10.399) by T and rearrange terms, we find 

dps „ r (j§ + jg-/tj-(w + v,)-n D -/i D x)' 
dt ' ' [ t 

+ [J* + - AJ - ( w + v s ) ■ n D - psT(v/ + V s ) - pPk] • V r (ij 

+ ■ V r (M* - A) - Y v r • A - in° : V r (w + v,). (10.400) 


To obtain Eq. (10.400), we have used the fact that 

i(J-p(v s + w)) V rA1 D = V r - (^j -^V,A- 

(10.401) 


where X = J — p(v s + w). Equation (10.400) can be simplified further if we 
first consider the case of a reversible fluid. The fact that the entropy production 
must be zero for a reversible fluid allows us to give a more precise definition of 
the reversible currents. 

In a reversible fluid, all the dissipative contributions in Eq. (10.400) are zero. 
The equation for a reversible fluid is then 


dps 

dt 


-v r 


v*e - Ajy 

T 


+ [j£ - pJ- psT( w + v,)] 



+ — X • V r (/ — p). 


(10.402) 


A reversible fluid, by definition, has no entropy production. This, in turn, means 
that the reversible energy current must be given by 

j£ = pJ + psT(yv + \ s ), (10.403) 

and the reversible entropy current is given by 

J? = i(j|-AJ) = P*(w + v s ). (10.404) 

However, the superfluid can’t carry entropy so we must have jf = ps\ n , and 
this in turn means that 


w = v„ - V,. 


(10.405) 


Also, p R = p, so the reversible part of the chemical potential is just the 



SPECIAL TOPICS: SUPERFLUID HYDRODYNAMICS 


635 


thermodynamic chemical potential. The quantity pP is a contribution to the 
chemical potential due to dissipative processes. We can write the equation for 
the dissipative fluid as 

^ = -V r • (psv„ + J°) - Ij° . V r 7- - ^ V r • >. - in D : V r v s , (10.406) 

where - v„ • f\ D — p D k and k = J — pv„. 

Let us now write the generalized Ohm’s laws for superfluid. There are two 
scalar forces, namely, V r • k and V r • v„. There is one vector force, V r T, and 
one symmetric tensor force, (V r v„)\ We therefore can write 


n D = -CiVr-A-c,v r -v„, 

(10.407) 

M — — C3^r ' A C4^r * V„, 

(10.408) 

J? = -fv r r, 

(10.409) 

(n°r = — »;(v r v„) s . 

(10.410) 


The superfluid has six transport coefficients. From Onsager’s relations we have 

Ci = Ca 


► S10.H.2. Sound Modes 


In this section we will obtain dispersion relations for the various types of sound 
modes in the absence of dissipation. To obtain sound modes in the 
nondissipative case, we set all dissipative currents to zero in Eqs. (10.388), 
(10.389), (10.391), and (10.392) and we linearize about absolute equilibrium. 
We shall write p = po 4- A p, s = so + As, P = Po + A P, where po, so, and P 0 
denote the equilibrium density, entropy, and pressure, respectively, and 
Ap = Ap(r, t), As — A$(r, t), and AP = AP(r, t) denote their deviations 
from equilibrium. We also neglect all terms to second order in the velocities, 
v„ and \ s . The hydrodynamic equations then take the form 


dt P + py r • v « + ^° V r • V, = 0, 

(10.411) 

^ + ^ + V,AP = 0, 

(10.412) 


(10.413) 
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5 + V r A M = 0. 


(10.414) 


Let us now choose the density and temperature to be our independent 
thermodynamic variables. Equations (10.41 1)— (10.414) then become 


+ Pn^r ■ V M + /?°V r • V, = 0, 

J> , 1 „ , ( dP 


^^ + ^ A ^{af)^ AT = ^ 


o 1 (dP\ dAp p°c p dAT 00 


(10.415) 

(10.416) 


s -7{a-T) 0 ^+^V + ' Vv - v ” = 0 ’ ( 10 - 417 ) 


dv s 1 0 1 (dP 


VAT = 0. 


(10.418) 


In Eqs. (10.415)— (10.41 8) we have used the fact that the isothermal 
compressibility is defined as k t = (1/ p)(dp/dP) T and the specific heat is 
defined c p — T(ds/dT) p . Also s — -( dfL/dT) p and 1/p = (dp,/dP) T , so that 
(■ dji/dT ) = ( dpjdT) p + ( dp,/dP) T {dP jdT ) and ( dp/dp) T = (dfi/dP) T 
{dP/dp) T = 1 /(p 2 k t ). 

Equations (10.41 1)— (10.414) enable us to obtain the dispersion relations for 
the various types of sound that can exist in an ideal superfluid system. Let us now 
Laplace transform the time-dependent part of the above equations and Fourier 
transform the space-dependent part. It we note that all contributions from the 
normal and superfluid velocities are longitudinal, we can write the Fourier- 
Laplace transform of Eqs. (10.41 1)— (10.414) in the following matrix form: 


Z 5°- 


i /ap\° 

7\&r) p 


~ikPn -ikp° s 
—ikp°s° 0 


k 

(p°) 2 K T 

1 


\ 0 

P 0 \dTj o \ 


J dP 




P° \dT, 
0 0 


p(k,z) 

Vn(k, z) 

f(M 

v s (k,z) 


pS 0 
0 0 


Ap(k,0) 
v„(k, 0) 
Ar(k,0) 
v,(k,0) 


(10.419) 
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The determinant of the 4 x 4 matrix, 


s(M = 


1 /dP\° 


p° \dT 


{p 0 ) 2 *t 


.,(dP 

~ ,k W 


ik J — r 


1 fdP 


P° \dT 


-ikp° n -ikp° s 


-ikp°s c ‘ 


(10.420) 


gives the dispersion relation for sound modes in the fluid. The sound 
frequencies occur at values of z which satisfy the equation 


rwWk m Afof 4 | f 1 | | T ° \(9P 

Det(S(k,z)] T0 |z+( p o K7 .+ £pp o + c () (p0) 2 [(ar 

+ ^)=0. 

P°Pn C pKT 


(10.421) 


We can simplify the dispersion relation somewhat because the thermal 
expansivity, a P , for He(II) is very small, at least for temperatures well below the 
transition temperature. This can also be seen in Fig. 3.20. For low enough 
temperature, the slope, (< dP/dT) p is approximately zero. Thus, to good 
approximation we can set 


1 f dp\ 

Oip = — ) — kt 

P \ dT J p 


(10.422) 


The dispersion relation then simplifies to 


Det[S(k,z)] « - 


p°p %\. 4 J i , rV)^. 2i , rWfu 


z*+ — + 
V At 


*V + o' d " ^ 

P°P° n CpKT 


r + — * 

P u k t 




k 2 = 0. (10.423) 

Pn c P / 


We see from Eq. (10.423) that there are two different types of propagating 
sound modes in He(II). They are called first sound and second sound , 
respectively. 

First sound has a dispersion relation, z = ±ic\k, and propagates with speed 


ci = A/Ar- 


(10.424) 
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First sound consists of density waves in He(II), similar to sound modes in 
classical fluids, with one difference. The speed of first sound depends on the 
isothermal compressibility, while the speed of ordinary sound in classical fluids 
depends on the adiabatic compressibility. Note that Eq. (10.424) assumes that 
(dP/dT)^ 0. 

Second sound has a dispersion relation z — ±ic 2 k and propagates with a 
speed 


C2 



(10.425) 


Second sound consists of temperature waves which can propagate across the 
fluid. It was first measured by Peshkov [43], who used an oscillating heat source 
to set up standing waves in a tube. A plot of the speed of second sound 
as a function of temperature is given in Fig. 10.15. We see that the velocity 
first peaks at 20.36 m/sec at T = 1.65 K and decreases slightly as we lower 
the temperature, but then increases again and reaches a limiting value of about 
150 m/sec at T = 0 K. 

Measurements of the speed of second sound enable us to obtain a value of 
the ratio p„/p Q . If we remember that p = p° n + pj, then Eq. (10.425) takes the 
form 


T 0 (s°f 

p° c p c\ + r°(s°) 2 ' 


(10.426) 


From Eq. (10.426), estimates of the hydrodynamic density of the normal fluid 
can be obtained. Another form for sound occurs when He(II) flows through a 
porous material. It is called fourth sound. Derivation of the dispersion relation 
of fourth sound is left as a homework problem. 


C2 (m/sec) 



Fig. 10.15. Speed of second sound. 
(Based on Ref. 44.) 
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In Eq. (10.414) we have postulated the existence of an additional 
hydrodynamic equation for He(II). Its origin is not the microscopic 
conservation laws, but broken gauge symmetry in the fluid. We shall consider 
the hydrodynamics of symmetry broken systems in the next section and obtain a 
deeper understanding of the origin of Eq. (10.414). 


SI 0.1. General Definition of Hydrodynamic Modes [45] 

The most commonly observed quantities in physical systems are the correlation 
functions or their Fourier-Laplace transforms, the spectral density functions. 
The behavior of these quantities gives essential information about the dominant 
underlying dynamics of a system. In many-body systems, as we shall show 
below, when correlation functions show hydrodynamic behavior, it can 
orginate from two different sources, either conserved quantities or broken 
symmetries. 

Let us consider the microscopic densities, a,(r), which may represent 
the number density, momentum density, spin density, or any other density we 
might be interested in. The equilibrium dynamic correlation function may be 
written 


Cy(ri,ri;r 2 ,r 2 ) = Tr[* 0H di(r u ti)dj(r 2 ,t 2 )\. (10.427) 

According to Eq. (6.53), the microscopic density evolves in time as 
ai(r u ti) = e iLti ai(ri), where L is the Liouville operator. Because we are 
looking at an equilibrium correlation function, the correlation function depends 
only on the time difference, t\ — t 2 . This is easy to see. The trace of a product of 
operators is invariant under cyclic permutation of the operators. Also, the 
Liouville operator, L, commutes with the Hamiltonian, H. Therefore the 
dynamic correlation function can be written in the form 

C„(: r,,r 2 : f, - t 2 ) = Tr[ e -^riK' I( ' ,_ ' j) %(r 2 )]. (10.428) 

Let us now introduce the Fourier-Laplace transform of the correlation function, 

»Sy(ki,k 2 ;z) = | dv x e + ^ vTl J dr 2 e +lli2r2 J d(t\ - t 2 )e~ z{h ~ t2] Cij(r x , r 2 ; t x - t 2 ). 

(10.429) 

In the absence of any spatially varying external fields, the correlation function 
can only depend on the relative displacement, r = ri — r 2 . Therefore, the 
spectral density function can be written 


S tf (k,,k 2 ;z) = V«(k, + k 2 )S, y (k 2 ,z), 


(10.430) 
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where 

Sij{k,z) = Trje -/W a,-(-k)— > (10.431) 


(cf. Section S10.F). In the discussion below, it will be useful to think of the 
spectral density matrix, Sy(k, z), as a matrix elements with respect to the states, 
|a,(k)). Then we can write it 


fy(M 



1 


z + iL 


“A k)\sTr 


* p «*( k )— — 7%( k ) 
z + iL 


(10.432) 


where a,(— k) = a*( k). We shall limit ourselves to small amplitude fluctuations 
(not too close to a phase transition) so that coupling between modes of different 
wavelength can be neglected. Also, we shall choose our modes so they are 
statistically independent (cf. Section 7.C). Then 


(a.OOKOO) = y k k(«.( k )l«.( k ))- (10.433) 

As we saw in Section 10.C, the dynamics is determined by the spectrum of the 
spectral density matrix in the complex z plane. In order to determine the 
behavior of this spectrum, it is useful to introduce projection operators. 


► SI 0.1.1. Projection Operators [45] 


Projection operators allow us to decompose the system into one space which 
contains the hydrodynamic densities and another space orthogonal to it. They 
also allow us to generalize the concept of a hydrodynamic mode to include 
modes which arise not only from conserved quantities, but from broken 
symmetries as well. Projection operators provide a very powerful means of 
analyzing systems. They were first used in statistical mechanics by Zwanzig 
[46], although the idea of projecting onto the macroscopic modes of a system 
was first used by Mori [47]. Projection operators have also played an important 
role in the microscopic theory of irreversible processes developed by Prigogine 
et al. [48]. They can be used to study stochastic processes [49]. In this section 
we shall only be interested in projectors onto the macroscopic modes of the 
system. Our discussion follows closely the presentation of Forster [45]. 

We introduce the projection operator; 


Pk 


£k( k )> 


1 

(«.(k)Kk)> 


(««-( k )l> 


(10.434) 


and an orthogonal projection operator, Q k , such that P k + Q k = /, P k and Qx 
are orthogonal projection operators because they satisfy the conditions 
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P\ = Pk, Qi = Qk, and P k Q k = Q k P k = 0. Q k removes all modes, {a t (k}, 
from any object that it acts on. The orthogonal space defined by Q k acts as a 
bath or reservoir that influences the behavior of the collection of modes, 

R(k)}- 

We will now use the projection operators to rewrite the spectral density 
matrix in a more revealing form. Let us first note that 


1 


1 


1 


1 ? a 1 

iLP k 


z + iL z + iLP k + iLQ k z + iLQ k z + iLQ k z + iL 


and 


1 


1 


z + iLQ k z 


1 - 


iLQ k 


z + iLQ k 


Then 


1 


— iLP k j + iLQ k 


iLP k 


1 


Z + iL z + iLQ k z Z + iL z Z + iLQ k z + iL 


(10.435) 


The spectral density matrix can then be written 
Sij(k,z) = (a t ( k) | -J-x | aj(k)^ 

= -(«/(k)> --^<a f ( k )|iL|a/(k» 

7 7 * * 


(«, ( k)Rk))( a ' (k) |FTlz|^ (k) ) ( ia436 > 


+ - V / fl/(k) j iLQ k iL ] a/(k) 

z i\ I z + iLQ k | 


1 


(a,(k)|a,(k)) 


ai( k) 


1 


z + iL 


«/(k) 


Equation (S10.I.10) can be simplified. Let us remember that dai(k)/dt = 
iLa,(k) so d\cii(k))/dt = iL|a,(k)). We can denote d\ai(k))/dt = |a,(k)) = 
iL|a,-(k)). Now define a frequency 

LJij = (ai(k)\L\aj(k)). (10.437) 

Then 

<a*(k)|o,(k)) = -<a f (k)K(k)) = (10.438) 
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We also introduce the so-called memory function, Afy(k,z), which is defined 
as 


Mij(k,z) = (ai(k)\iLQ k iL\aj(k)) 

z + iLQ k 

= -(g,-(k)|Qk — V^ -fe(k))- 

z + iLQk 

Then the equation for the spectral density matrix takes the form 
zl • S(k, z) + ifi(k) • G _l (k) • S(k,z) + M(k,z) • G _1 (k) • S(k, 


(10.439) 


z) = G(k), 
(10.440) 


where Gy(k) = (a,(k)|a / (k))^y. For the case of two modes, Eq. (10.440) takes 
the form 


( z 0 > \ + Y Wl1 Uu \f G n 0 \ 

\ 0 Z ) V ^21 ^22 / \ 0 G 22 j 

(M n M 12 \(Gu 1 0 \l/5„ S l2 \ = ( G u 0\ 

Vm 21 M 22 )\ 0 G^JjV^i S 22 ) 1^0 G 22 )' 

(10.441) 

Let us now look more carefully at the structure of this equation. We first 
consider the case of conserved densities and then consider the case of broken 
symmetries. 


► SI 0.1.2. Conserved Quantities 

For simplicity, let us consider a system with two hydrodynamic modes, whose 
microscopic operators are given by d\ (r) and a 2 (r). We consider the case when 
(d\\a 2 ) = {a\\a 2 ) = 0 so there are no propagating (sound) modes. From the 
linearized hydrodynamic equations, the average values of these quantities 
evolve according to the phenomenological equations 

d fcijiv)) = miV 2 (ai ( r>f )) + )?12 V 2 <a 2 (r,»)> (10.442) 


and 


= %1 V 2 (fli(r, t)) + 7722 V 2 (a 2 (r, t)), 


(10.443) 
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where rfrj are transport coefficients. From these equations we can write the 
equations of motion for the correlation functions (cf. Ex. 10.4) 

d_( C\\(r,t) Ci 2 (r,t)\ = f t?h t? 12 \ y2 / Cn(r,t) Ci 2 (r,f)\ 
dt \ C 2 i(r, t) C 22 (r, t) J \rf 2i r\n ) r ^C 2 i(r,f) C 22 (r,f)y’ 

(10.444) 

where Q(ri - r 2 , t\ — t 2 ) = Tv[e~ dH a\(ri,t)a 2 (r 2 , t 2 )]. If we take the Fourier- 
Laplace transform of Eq. (10.444), the equation for the spectral density matrix 
takes the form 


\(z S 12 (k,z)\ = /G„(k) 0 \ 

[V° V V»fti ^22 yj ^5 21 (k,z) S a (k,z)J ^ 0 Ga(k)/ 

(10.445) 

Comparison with Eq. (10.441) yields the following identification: 

Vij = limlim/iMy^zjGj: 1 ^)}. (10.446) 


Thus the transport coefficients are given by a specific limit of the memory 
function. 

It is useful to examine the memory function, Afy(k,z), a little more closely. 
Following a derivation similar to that used to obtain Eq. (10.445), we can write 


1 1 U 1 11 

— — — — FkiL — H ~ — — 

z + iL z + iQkL z z + iL Zz + iQkL 


QkiLPkiL 


1 


Z + iL 


(10.447) 


We then can write 



1 

z 4 - iL 




_UL\ 

z + iQkL I / 



1 

z + iQkL 



1 

z + iL 



(10.448) 


where we have used the fact that (a/|Pk = 0. We can rewrite Eq. (10.448) in 
matrix form to obtain 




z+iL 1 


( 1 0\ 1 (MuG- n l 

\0 1 / z yM 2 iGf/ 


MnG^Y 1 (Mu 

M 22 G^ 2 J \Af 21 


M n \ 

m 22 )' 


(10.449) 
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In Section 6.D.3 we found that if the quantity, a,(r,t) is conserved 
microscopically, then it satisfies a microscopic balance equation of the form 


dhi 

~dt 


V r ■ Ja,(r> f)- 


(10.450) 


Therefore, |a,(k)) = i\k • J a .(k)) and My is of order k 1 . Therefore, to lowest 
order in k 2 , we can write My « (a,|l/(z + iL)|a/). The transport coefficient, 7/y, 
takes the form 


•» = - (* ' J ' | ITU | ' J ') G " 1(k) } 

= - lim lim j dt j (k • J,-|«"^ + ^|k • J ))Gjj 1 (k) J , 


(10.451) 


where k is the unit vector in the direction of k. In Eq. (10.451), we have written 
the transport coefficient, r?y, in terms of a current correlation function. This is a 
form commonly seen in the literature. 


^ SI 0.1.3. Hydrodynamic Modes Due to Broken Symmetry [45] 


In this section we wish to find the origin of hydrodynamic modes which arise 
due to broken symmetries. However, it is useful first to consider how the 
spectral density function for a quantity which is not conserved on the 
microscopic level behaves. Let us consider a microscopic quantity, B, which has 
a density, 6(k, r), which is not conserved during collisions and does not 
commute with the Hamiltonian. One such quantity might be the orientation of 
the molecules in the system. It will satisfy a microscopic balance equation of 
the form 

^^ + ik-j*(k,f) = o-*(k,0, (10.452) 

where <T*,(k,t) is an operator which acts as a source of the quantity B and 
satisfies the condition limA ; _ > o^(k, t) = ab(t) = J drab(r,t ) ^ 0, Thus, 
(d/dt) J drb(r, t) = a b{t) ^ 0. For simplicity, let us assume (/>(k)|fc(k)) = 0. 
Then 

lim(6(k)|0k Kj-Q k \b(k)) = {(J b \Qo \f~r QoWb) 

0 z + iLQ k z + iLQo 


and the memory function does not go to zero as k — ► 0. Thus, to lowest order in 
k the spectral density has a pole independent of k and we can write 


limS^(k, z ) 
k-+ o 


Gbb{ 0 ) 

Z + T^ 1 ’ 


(10.453) 
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where the relaxation time, u is independent of k and can be expressed in terms 
of the memory function. This is definitely not hydrodynamic behavior. 

Let us now consider what happens when a symmetry is broken. Let Q be the 
generator of a symmetry transformation. It is a constant of the motion and 
therefore commutes with the Hamiltonian so [Q, H] = 0. The density, q(r), of Q 
will satisfy the microscopic balance equation 


dq 

ai + Vr 


J Q = 0. 


(10.454) 


Let us now consider two other densities, a and b, such that 
(l/ifc)[Q,fl(r)] = b( r). The densities, a and b, may or may not be conserved 
quantities. If we take the equilibrium expectation value of the commutator, 
[Q,fl(r)j, we find 


^Tr[p eq [Q,a(r')]\ = ~ JrfrTr[p4q(r),a(r')]] 

= ^prTr[a(r / )[p eg ,q(r)]] = (b( r')), 


(10.455) 


where p eq is the equilibrium state of the system. Notice that (b(r')) ^ 0 only if 
[p, q(r)] ^ 0. Thus even though Q commutes with the Hamiltonian, it may not 
commute with the state of the system. When this happens, the symmetry 
generated by Q has been broken. The quantity, (b( r)) is the order parameter for 
this broken symmetry. For a translationally invariant system we have 
(b(r) = bo , where bo is a constant. 

As we shall now show, the fact that (b(r)) ^ 0 means that there will be 
hydrodynamic waves in the density, (a(r, t)}. These waves are called Goldstone 
bosons. To see this, let us rewrite Eq. (10.455) as 


ih 


| drTr[p e g[q(r, /),a(r')]] 


= b 0 . 


(10.456) 


Then from microscopic linear response theory [cf. Eq. (10.239)], we have 

| driest - r',f) = K" a ( k = 0,1 ) = ~b 0 . (10.457) 

But since Q is a conserved quantity, K" a ( r — r, t) must be independent of time. 
The Fourier transform of Eq. (10.457) then yields 

Xq fl (k = 0, u) = i7rb 0 6(u). (10.458) 

Now let us assume that the limit in Eq. (10.458) is uniform. {A function, f ( k), is 
uniform at k= 0 if f (r) falls off rapidly enough so that J dr /( r) < oo. Then, 
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lim^o/lk) =/(k = 0) = Jdr/(r).} The limit in Eq. (10.458) is uniform: 
lim^ox" a (k,a>) = x" a (k = 0,o>) if Xq fl (r - r'.w) = (i7^)([q(r),fl(r / )]) 
vanishes rapidly enough as r — r 7 — ► oo. This will be true if q(r) and a(r) are 
local densities — that is, depend on field operators in the neighborhood of r and 
r 7 . This will be true for systems with short-range interactions. The fact that 
Xq a (r,a>) decays rapidly need not mean that x q fl (r,o;) decays rapidly. 

If q(r) and a( r) are local operators, we can write 

[im Xqa( k > w) = inb 0 6(cj). (10.459) 

k—> 0 M 

Thus, there is a mode whose energy goes to zero as k — > 0. Let us assume that 
this is true. Then 


Xqa ( k > <*>) = inbo 6 (a; - u>(k) ) , ( 10.460) 


where lim*_>ow( k ) = 0. The mode with frequency, a>(r), is the Goldstone 
boson. 

The spectral density function for the modes, d(k), can be written 


Saa ( k 5 z) 


Gga(k) 

Z + Maa( k, z)G- ! (k) 


(10.461) 


if we assume that (a(k)ja(k)) = 0. Since a(k) is not a conserved quantity, we 
will have 


lim/d( k ) | Q k - 1 | a(k)\ = <7 a , (10.462) 

*-°\ I z + iLQ k | / 

where <r a is a constant. The only possibility of obtaining a pole which goes to 
zero as k — ► 0 lies in the quantity G aa (k). Thus, we must determine how Goa(k) 
depends on k. We first use the Schwartz inequality to write 

(a(k)|a(k))(q(k)jq(k))>|(a(k)|q(k))| 2 . (10.463) 

Let us next use Eq. (10.129) and Exercise 10.5 to write 

G oa (k) = (a(k)Kk)) = ±Xo„(k,u, = 0) = (10 ' 464) 

Equations (10.463) and (10.464) yield the so-called Bogoliubov inequality, 

f dcdx'Jg(K u) f du) Xai^k.oS ) 1 

) 7T UJ J 7 T U> ~|J 7 T OJ \ 


(10.465) 
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Let us now note that since q is a conserved quantity, q = iuq and q = ik k • J q . 
Thus, 


tdcv Xg 2 

J 7T U> 



(10.466) 


(see Eq. (10.458)) and 

f dwXjgiKv) _ k 2 f duJ ^k Jqk-Jq ^ 
J 7T uj J 7r a ; 


(10.467) 


Thus, the Bogoliubov inequality yields 


Gaa( k) = 


A. 

k 2 Ra’ 


(10.468) 


where 


f d(x) ^k-J k-J 

R a < /Him (10.469) 

lc—*0 J 7T UJ 

is called the stiffness constant. A broken symmetry can give rise to a k ~ 2 
dependence in Gaa{ k), and the spectral density function for mode a(k) takes the 
form 


Saa (k, z) 


Gqg(k) 

z + k 2 R a (T a bQ 2 ' 


(10.470) 


Thus, a(k) is a hydrodynamic mode even though it is not a conserved quantity. 

As a consequence of this broken symmetry, the state is not described by the 
density matrix p eq = e~ 0H , where [H, Q] = 0. Instead it is described by a new 
density matrix for which [p eq , Q] ^ 0. We have already seen this in several 
examples in Chapter 7. In fact, one way to determine if a system has undergone 
a phase transition is to include the order parameter in the density matrix in a 
physically meaningful way. For example, apply an external field that couples to 
the order parameter. Then find a temperature-dependent equation for the order 
parameter. Above the transition temperature, the stable solution is the one for 
which the order parameter is zero when the external field is set to zero. Below 
the transition temperature, the stable solution is one with a nonzero order 
parameter when the external field is set to zero. 


j ■ EXERCISE SI0.10. A spin system has Hamiltonian H = 
~ where a and (3 denote lattice sites, the 

I coupling constant, /(|r Q — r^|), depends only on the magnitude of the dis- 
placement, r Q — rp, between sites a and (3, and S a and S Q are the spin 
| operators associated with sites a and (3. The total spin, S tot = J^ a S a , 
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commutes with the Hamiltonian so [H, S to ,] = 0. Show that if (S Z j OI ) — M 0 
and (S x jot) = (S y ,tot) = 0, the rotational symmetry of the system about the z 
axis is preserved but about the x and y axes it is broken (rotational symmetry 
about axis i is preserved if [p eq , = 0). 


Answer: First note that at a given site the spin operators satisfy commutation 
relation, [Si ^ a , = iflSijk Sk, a , £xyz ~ ^yzx — ^zxy — 1 1 ^-xzy — c-zyx — £yxz — 1» 

and £ijk = 0 otherwise. Spin operators belonging to different lattice sites 
commute. Note also that [5 X)tor , S yttot ] = ihS Z)tot . Thus, 

Mq — r Tv\p e q,S z jot\ = Tr \peqi [ Sx,tot) 


— ^ Tr[[p e ^, ^ Tr[[p g g, 


(1) 


Thus, if Mq is nonzero, then [p eq , S* w ,] ^ 0 and [p eq , S y j ol ] ^ 0. By a similar 
argument we can show that [p eq ,S Z! tot] = 0. Thus, the z-component of the 
total spin operator commutes with the state of the system and rotational 
symmetry of the state of the system about the z- axis is preserved. However, 
it is broken about the x and y axes. This means that the state of the system 
remains unchanged by rotating all spins about the z axis through the same 
angle. This will not be true if all spins are rotated about the x or y axis. (Note 
that we have neglected all the structure of the lattice that supports these 
spins.) 


■ EXERCISE S10.ll. At the A-transition in He 4 , gauge symmetry is 
broken. The generator of a gauge transformation is the number operator, 
N = JdrV>t(r)V>(r), where $(r) and i/>(r) are boson field operators which 
create and destroy particles, respectively, at point r (cf. Appendix B). Show 
that if [p eq ,N] ^ 0 so that gauge symmetry of the system is broken (where 
p eq is the equilibrium state of the system), then {^(r)) = Tr[/)^(r)] is the 
order parameter. 

Answer: First note that the commutation relations for boson field operators 
are given by 

Mr), p (<■')} = S(r - r'), [V’(r), V'(r')] = 0, and Mr), Mr')] = 0, 

( 1 ) 

The commutation relations of the field operators with the number operator 
are given by 


[N, </)(r)] = M( r ) and [/V, Mr)] = Mr). 


( 2 ) 
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Therefore, we can write 

(V’(r)} = Tr[p eg V>(r)] = -Tr[p eq [N,^(r)}] = Tr [[p eqj N]ij)(r)\. (3) 

Thus, when the gauge symmetry of the state of the system is broken, then 
{^(r)) is the order parameter. 
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PROBLEMS 

Problem 10.1. Assume that a viscous fluid flows in a pipe with a circular cross section 
of radius a. Choose the z axis to be the direction of motion of the fluid. Assume that the 
density of the fluid is constant and that the flow is steady. Starting from the 
hydrodynamic equations, find the equation relating pressure gradients to velocity 
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gradients in the fluid. Describe what is happening in the pipe. (Note: For steady flow, the 
velocity is independent of time, but can vary in space. At the walls the velocity is zero 
due to friction.) 

Problem 10.2. Consider an ideal fluid. Assume that at time t = 0 the Fourier amplitudes 
of the density, velocity, and temperature variations are given by Pk(0), Vk(0), and 7k (0), 
respectively. Compute Pk(0> v k(0» an d 7k(/). 

Problem 10.3. Consider a fluid of point particles. Assume that at time t = 0 the Fourier 
amplitudes of the coupled longitudinal density, velocity, and temperature variations are 
given by pk(0), vj[(0), and 7k(0). Compute fa(t), v il(0> an d 7k(/), but only keep terms in 
the amplitudes which are zeroth order in the wavevector, k. 

Problem 10.4. Consider an isotropic fluid whose deviation from equilibrium can be 
described by the linearized Navier-Stokes equations. Assume that at time t = 0 the 
velocity is v(r,0) = voe' a ^z. Compute Vjj- (/), where Vjj- (t) = Vk(0 - kvj |(t) and Vk(t) 
is the Fourier transform of v(r, t). 

Problem 10.5. Consider an isotropic fluid contained in a rectangular box with sides of 
length L x = L,Ly — 2 L, and L z = 3 L. Assume that the temperature of the fluid at time 
t — 0 has a distribution T(r, 0), but the fluid is initially at rest. Assume that the thermal 
expansivity of the fluid is very small so that coupling to pressure variations can be 
neglected, (a) Show that under these conditions the temperature variations satisfy the 
heat equation, dT(r,t)/dt — -KV 2 T(r,t). What is rc? (b) If the walls of the box 
conduct heat and are maintained at temperature, T 0 , approximatlely how long does it 
take for the system to reach equilibrium, (c) If the walls of the box are insulators, 
approximately how long does it take for the system to reach equilibrium? (Hint: No heat 
currents flow through the walls of insulators.) 

Problem 10.6. Consider an isotropic fluid of point particles, (a) Write the dyatic, V r v, 
in spherical coordinates, (b) Use your results in (a) to write the stress tensor in spherical 
coordinates. 


Problem 10.7. A Brownian particle of mass m is attached to a harmonic spring with 
force constant, k, and is driven by an external force, F(t). The particle is constrained to 
move in one dimension. The Langevin equation is 


+ + mu%x(t) = £(t) + F(t), 


dt 2 


dt 


where u>q = k/m, 7 is the friction constant, and f (f) is a Gaussian white noise with zero 
mean, (((t))^ = 0. Here ()^ denotes the average over values of the random force, (a) 
Compute the equilibrium correlation function, (x(f)x( 0 )), starting from the Langevin 
equation above with F(t) = 0. Let () T denote the thermal average over the initial 
position and velocity of the Brownian particle. Assume that (x(0)v(0)) r = 0 and 
(x(0) ) T = k B T/mujQ (cf. Exercise (5.5)). (b) The dynamic susceptibility for the 
Brownian oscillator is x(u>) = (-mu? + mu% - ryc^T 1 (cf. Exercise ( 10 . 6 )). Use the 
fluctuation-dissipation theorem, 


(x(t)x( 0 )) = p f -xMcos(wf), 


I7T 


J —00 ^ 


to compute the equilibrium correlation function. Do your results in (a) and (b) agree? 
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Problem 10.8. A paramagnetic system sits in a constant external field, Hqz, and 
therefore has constant magnetization, Mqz. Assume that this is the equilibrium 
configuration of the system and that a small uniform time- dependent magnetic field, 
H\{t) — Hicos(u>ot)x is applied to the system. The magnetization then becomes time- 
dependent. Its equation of motion can be written 

^ = -1 (M)xH(i)-D((M)) 

(these are called Bloch equations), where (M) = Tr(pM) = M x (t)x + M y (t) y+ 
( Mq — M z {t) z, H(t) = Hqz + Hi (t)x, and D((M)) is a damping term due to interactions 
between particles in the medium. The constant 7 equals gp/H, where g is the Lande 
^-factor and n is the magnetic moment of particles in the system, (a) Write the 
equations of motion for M x (t),M y (t ), and M z (t), assuming D x = M x /T 2 ,D y — M y /T 2 , 
and D z —M z /T\. The equations you derive will be nonlinear functions of Hi,M x ,M y , 
and M z . (b) Assume that H\,M x ,M y , and M z are small, so the equations can be 
linearized. If we define the response matrix, K(t), from the equation 
(M(r)) = J’^ o d/ / K(r - 1 1 )- H(t'), find Xxx{u) and K^t). Write the expression for 
Xxx(u) in the limits 1 /T\ — ► 0 and l/r 2 — > 0. (c) Compute the average power absorbed 
during the period T — Itt/ujq. (Note that x,y, and z are unit vectors in the x, y, and 2 
directions, respectively.) 

Problem 10.9. Consider a casual function, g(t), such that g(t) = 0 for t < 0 and 
g{t) / 0 for t > 0 . Let g L (z) — J 0 °° dte~ zt g(t) be the Laplace transform. Let 
gf-(cu) = 1°^. dte lult g{t) be the Fourier transform. Show that 



and show that the inverse transforms give the correct behavior for g(t). 

Problem 10.10. Consider an aqueous solution of salt, A^C^, which dissociates as 
^ v a A~ + v c C + , where v a and v c are the stoichiometric coefficients for the 
dissociation. The generalized Ohm’s laws for the diffusion currents are 

J a = -taaVfc -L cc Vj2 e c , and jf = -L ca Vjl e a -L cc Vfx e c , 

where Laa,L cc ,L ac and L ca are Onsager coefficients. Since L ac = L ca , only three 
are independent. Three independent transport coefficients can be measured experi- 
mentally for this system, and their values can be used to obtain values for the 
Onsager coefficients. The measurable quantities are the electric conductivity, 
k ~ I/E, the diffusion coefficient, D s , for the salt, and the Hittorf transference 
number, r a — ZaFJ® / IM a , which is the fraction of the current carried by one of the 
ions. Find expressions for (a) k, (b) r a , and (c) D s in terms of the three 
Onsager coefficients. (Assume uniform temperature and pressure throughout the 
system.) 

Problem S10.1. Compute corrections of order k to the spectral density function of an 
isotropic fluid of point particles. Estimate the size of these corrections relative to the 
dominant terms in a light scattering experiment. 

Problem S10.2. Consider a rough hard sphere of radius R immersed in an 
incompressible fluid of point particles. Assume the sphere is free to rotate about its 
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diameter but cannot translate. Also, assume that the sphere has angular velocity, 
fl(t) = (a) What is the velocity distribution of the fluid? (b) Let r{t) be the 

torque on the sphere due to friction between the sphere and the medium, and let f(u) be 
its Fourier transform. Show that 

/ k 2 R 2 \ 

where f2 w is the Fourier transform of 0(f). [Hint. The velocity is a polar vector and Q(t) 
is an axial vector, so assume the velocity has the form v = V x (g(r)Qoi), where g(r) is 
to be determined by solving Eq. (10.391). Assume stick boundary conditions on the 
surface of the sphere and assume the fluid velocity and all its derivatives go to zero at 
infinity.] 

Problem S10.3. Consider a particle undergoing rotational Brownian motion. Write an 
expression for the angular velocity autocorrelation function for the rotating Brownian 
particle. Show that for long times it depends on time t as follows: 



where po is the fluid density, 77 is the shear viscosity, 7 is the moment of inertia of the 
sphere, and (|fl(0)| 2 ) r is the initial mean square angular velocity. (Hint: Use the result 
of Problem S 10.2 for the friction coefficient.) 

Problem S10.4. Consider a membrane of thickness Ax which separates two regions 
containing dilute aqueous solutions of ions. One particular ion, i, with charge Zi can 
diffuse through the membrane. It has diffusion coefficient, D, inside the membrane. A 
constant electric field, E, directed perpendicular to the membrane surface exists inside 
the membrane. Assume that a steady current, /„ of ions diffuses across the membrane. 
The phenomenological equation relating the current to the gradient of the 
electrochemical potential of the ion inside the membrane is /, = L(dp e Jdx), where L 
is the phenomenological coefficient, L = Dci/RT , and c, is the concentration of ion i. 
For dilute solutions, the gradient of the electrochemical potential of the ion is 
p* = ( RT/cj)(dci/dx ) + Zi F(d(f)/dx), where 0 is the electric potential, F is Faraday’s 
constant, and E — —(d(f>/dx). Show that the ion current inside the membrane can be 
written 

Dzi FA <f> ( cf - c*exp(ziFA<f>/RT)) 

' “ RTAx (1 - exp(zi¥A<t>/RT)) ’ 

where cf and cf are the ion concentrations on the inside left and right surfaces of the 
membrane, respectively, and A <f> = 4 >r — 4>l is the difference in electric potential on the 
inside left and right surfaces of the membrane. 

Problem S10.5. The entropy production, a, in a thermocouple can be written 
To = -J s • V,T - I • (V r (^) - FE), 

where Js is the entropy current, I is the current carried by the electrons, F is Faraday’s 
constant, E is the electric field in the metal wires, and /x e i is the chemical potential of the 
electrons. The balance equation for the entropy /volume, s, is ds/dt = —V • Jj + a. The 
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generalized Ohm’s laws can be written 

Js = -^V r 7- + El and ( E - V, 0?) ) = -< V,T + «I, 

where A is the coefficient of thermal conductivity at zero electrical current, R is the 
isothermal electrical resistance, £ is the differential thermoelectric power, and T/T is the 
entropy transported per unit electric current, (a) Show that the Onsager relation, 
Lse = L E s, implies that T = T£. (b) Show that the entropy balance equation can be 
written ds/dt = (A /T)Vl(T) - V r • (n/7) + RI 2 /T. The first term on the right is the 
entropy production due to thermal conductivity. The third term on the right is the 
entropy production due to Joule heating. The second term on the right is the entropy 
production due to the Peltier and Thomson effects. 

Problem S10.6. Consider the thermocouple shown in Fig. (10.4). Replace the potentio- 
meter with a capacitor held at temperature 7o with wire of metal B attached at points a 
and d. Show that 


* j. 

where A<t>ad = 4>a - <f>d is the potential difference across the capacitor and (a and £ B are 
the differential thermoelectric powers in the two metals (cf. Problem S10.5). Note that 
in this circuit, there is no electric current. 

Problem S10.7. The junction between two metal wires, A and B, in a thermocouple has 
a circular cross-sectional area, tt p 2 , where p is the radius of the wires. Assume that the 
transition between the two metals takes place smoothly over a distance, A L, so the 
transition region (the junction) has a volume, np 2 AL. Assume that the temperature is 
uniform throughout the junction and that an electric current, I, is flowing through the 
junction. From Problem S10.5 the entropy production, a (entropy/volume • time), is 
given by 



In the limit AL — > 0 , show that the Peltier heat, tt which is just the entropy/ 
(area • time • current) produced (or lost) in the junction, is given by tt = (T# - Ta). 

Problem S10.8. Consider the flow of He(II) through a porous material. Only the 
superfluid can flow so v n = 0 . Also, the porous material exchanges momentum with 
fluid so that momentum is not conserved and the momentum balance equation cannot be 
used. Use the linearized nondissipative superfluid hydrodynamic equations to determine 
the dispersion relati on for density oscilla tions (fourth sound). Show that fourth sound 
has a speed c 4 = > J{p n /p)c\ + ( p s /p)c\ where cj and c 2 are the speeds of first and 
second sound, respectively. Assume that ( dP/dT) p = 0 . 

Problem S10.9. Derive the wave equations for first sound and second sound in a 
nondissipative superfluid for the case when ( dP/dT) p = 0 Show that only second sound 
propagates when the momentum density is zero. Show that only first sound propagates 
when the super fluids and normal fluids move in phase so v„ = v*. 

Problem S10.10. Consider a magnetic crystal whose magnetic interactions are governed 
by the Hamiltonian, H = - 3 X^/ 3 ^ I r « ^ r ,3l) s a • S^, where a and (3 denote lattice 
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sites and the coupling constant, /(|r a - r^J), depends only on the magnitude of the 
displacement, r a - rp, between sites a and ( 3 . The i th component (i = jc,y, z) of the 
magnetization operator is M,(r) = ^2 a S i<a 6(r - r a ), where r a is the position operator 
for the a* lattice site and Q is the I th component of the spin operator for that site. The 
total spin of the lattice can be written S,>, = JdrM,(r). The total spin operator 
commutes with the Hamiltonian, so the spin density (the magnetization operator) 
satisfies the balance equation 


dt 


+ ]C Vr • y (t( r »0 = °- 

j 


where 7y( r, t) is a magnetization current. Consider only long- wavelength modes so the 
discreteness of the lattice can be neglected. Assume that ( S zlot ) = Mq and 
(S x ,tot) ~ ( $x,tot ) = 0. Compute the spectral density matrix 

/ $m x m x (k, z) S\f x M y (k, z ) \ 
z) SM y M y (k,z) ) 

and determine its poles to lowest nonzero order in k. Describe the hydrodynamic 
excitations in this system. What is unusual about them? 
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TRANSPORT THEORY 


11. A. INTRODUCTION 

In this chapter we will consider the microscopic theory of dilute gases which 
are close to equilibrium. We assume that all disturbances are slowly varying in 
space and have small amplitude. Thus, in any small region the system will be in 
equilibrium and its state in that region can be specified by thermodynamic state 
variables. However, the values of thermodynamic variables can vary from one 
region to another in the fluid. 

When a system is disturbed from its equilibrium state, all quantities which 
are not conserved during collisions decay rapidly to their equilibrium values. 
After a few collision times, only quantities which are conserved during the 
collisions remain out of equilibrium. The densities of conserved quantities 
entirely characterize the nonequilibrium behavior of the fluid after long times. 
The equations of motion for the densities of conserved quantities are called the 
hydrodynamic equations. Examples of conserved quantities are the number of 
particles, the total momentum of the particles, and the total kinetic energy of the 
particles (for elastic collisions). If there are inhomogeneities in the densities of 
these quantities, then particles, momentum, or kinetic energy must be 
transported from one part of the fluid to another to achieve equilibrium. The 
rate at which a fluid returns to its equilibrium state is determined by the 
transport coefficients. They are the constants of proportionality between the 
gradients in the densities of conserved quantities and the currents that result 
from those density gradients and lead the system back to equilibrium. The aim 
of transport theory is to compute the transport coefficients in terms of the 
microscopic properties of the fluid. 

In order to build intuition concerning the nature of transport processes, we 
will first derive expressions for the coefficients of self-diffusion, shear viscosity, 
and thermal conductivity using the simplest possible mean free path arguments. 
The expressions we obtain make it possible to estimate the order of magnitude 
of the transport coefficients by simple “back of the envelope” calculations. 

To obtain a deeper understanding of transport phenomena, we must derive 
the transport coefficients from a more rigorous microscopic theory. That is, we 
must derive expressions for them starting from the kinetic equation for the fluid 
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(cf. Section 6.C). However, we are then faced with one of the fundamental 
dilemmas of statistical physics. The exact kinetic equation derived in Chapter 6 
is reversible, whereas transport processes are irreversible or dissipative. 
Boltzmann was the first to derive an irreversible kinetic equation to describe 
relaxation processes in fluid. He did it by using probabilistic arguments. Thus, 
his equation is not based rigorously on the properties of the underlying 
dynamics of the system. However, the Boltzmann equation works extremely 
well in providing numerical values for the transport coefficients in dilute fluids 
and to this day remains one of the great milestones in the history of statistical 
physics. Much of this chapter is devoted to a study of the Boltzmann equation. 
We will derive it using arguments similar to Boltzmann’s original arguments, 
and we will indicate its irreversible nature by proving the famous Boltzmann H 
theorem. From the H theorem, Boltzmann was able to obtain a microscopic 
expression for the entropy of a system near equilibrium which has the proper 
behavior as the system approaches the equilibrium state. 

We will apply Boltzmann’s equation to the case of dilute two-component gas 
of particles which have different identity but are dynamically the same. For this 
system, we derive microscopic expressions for the coefficients of self-diffusion, 
shear viscosity, and thermal conductivity. The method we use to derive the 
transport coefficients is very simple and elegant and was first introduced by 
Resibois. We first derive the hydrodynamic equations from the Boltzmann 
equation, introducing into them the transport coefficients, and we then find the 
normal mode frequencies of the hydrodynamic equations in terms of the 
transport coefficients. We next find the hydrodynamic eigenfrequencies of the 
Boltzmann equation and match these to the normal mode frequencies of the 
hydrodynamic equations. This gives us the desired microscopic expressions for 
the transport coefficients. 


ll.B. ELEMENTARY TRANSPORT THEORY [1, 2] 

Before we discuss the full microscopic theory of transport processes based ont 
he Boltzmann equation, it is useful to derive the transport coefficients using 
very simple mean free part arguments and kinetic theory. These arguments are 
entirely probabilistic and are independent of the form of the interparticle 
potential, although they do assume that the interaction is very short-ranged 
compared to the average interparticle spacing. As we will see, the same type of 
arguments can be used to obtain microscopic expression for the reaction rates in 
chemical systems. 

11.B.1. The Maxwell-Boltzmann Distribution 

Let us consider a dilute gas of N particles contained in a volume V. We assume 
that the gas is in equilibrium and that the interaction energy between particles is 
negligible compared to the kinetic energy of the particles. The probability 
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density of finding the particles with phase space coordinates in the interval 

pV pjv _j_ ^p n anc j qW _> qW _|_ ^ j s 


p(p N ,q N ) 


exp(-/3Eti^) 

J^P" J^exp(-/»E«£) ’ 


( 11 . 1 ) 


where dp N dq N = dp x x • • • x dp N dq x x • • • x d The probability density, 
Pi (p.). that particle 1 has momentum, in the interval, p, — *• p! + dp x , is 
obtained by integrating over all position variables, and the 

momentum variables, p 2 , . . . p N , 


Pi(Pi)= ...J^P 2 


(3 V /2 

) exp( - 


2tt m 


x dp N dq x x 

to? 

2m 


xdqfip(ff,q N ) 


( 11 . 2 ) 


Similarly, we can write an expression for the normalized probability density, 
F(v!), of finding a particle with velocity, Vi — ► Vi + d\ x , where Vi = p x /m. It 
is given by 


F(V|) = 



(11.3) 


and is normalized to one when we integrate over Vi. Equation (11.3) is called 
the Maxwell-Boltzmann distribution. 


11.B.2. The Mean Free Path 

The mean free path, A, of a particle is the average distance it travels between 
collisions. We shall assume that collisions occur at random in a gas. We wish to 
find the probability that a particle can travel a distance r without collision. 
Because the collisions occur at random, a particle has the same chance of 
collision in any interval r — > r + dr. If the average number of collisions per unit 
length is 1/A, then the probability that a collision occurs in an interval dr is 
dr/ A. 

Let P 0 (r ) denote the probability that no collision occurs in an interval of 
length r. Then the probability that no collision occurs in interval of length 
r + dr is 

P 0 (r + dr) = P„(r)(l-~\ (11.4) 

The factor (1 — (dr/ A)) is the probability that no collision occurs in interval dr. 
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We multiply P 0 (r ) and (1 — (dr/ A)) together because the events “collision in 
length r” and “collision in length dr ” are independent. If we expand the left- 
hand side of Eq. (11.4) in a Taylor series, we obtain 

J r p o(r) = (11-5) 

and therefore 

P 0 (r) = e~'l\ (11.6) 

The probability of no collision in length of path r is a Poisson distribution. The 
probability that a particle suffers its first collision in an interval r r + dr is 
P 0 (r)(dr/X). The average distance traveled between collisions is 

(r)= Pr/> 0 (r)Y = A > (H-7) 

Jo A 

which is just our original definition of the mean free path. 


11.B.3. The Collision Frequency 

Let us consider a fluid containing several types of particles A, B, C, ... and so 
on. Let us assume that particles A have mass m A , a diameter d A , and a number 
density n A \ particles B have a mass m B , a diameter d B , and a number density n B \ 
and so on. Let us further assume that the particles are distributed at random in 
the fluid and that each type of particle is distributed in the fluid according to the 
Maxwell-Boltzmann distribution [cf. Eq. (11.3)]. 

We will find the collision frequency between particles A and B. The radius of 
the sphere of influence between A and B is 

d Aa = ^ (11.8) 

(cf. Fig. 11.1). The average relative speed between particles A and B is given by 
(v r ) AB = J J dy B F(y A )F(y B )\y A -\ B \. (11.9) 


The center-of-mass velocity of particles A and B is 


V™ = 


fn A y . 4 + mb^b 
m A +m B 


( 11 . 10 ) 


and the relative velocity is 


y r =V A - y B . 


(li.ii) 
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Fig. 11.1. The circle of radius d AB is the sphere of influence for particles A and B. 


The Jacobian of the transformation from coordinates \ A and v B to 
coordinates v r and \ cm is equal to one. Thus, {v r ) AB can be rewritten 

M AB = ^^) V1 ^^) V1 \\ dy,dVcmVre ' W2)( "" V ~ + ' W \ (1U2) 

where M AB — m A + m B is the total mass and fi AB = m A m B /(m A + m B ) is the 
reduced mass. If we perform the integrations in Eq (11.12), we obtain 


( V r)AB ~ 



(11-13) 


for the average relative speed of particles A and B. 

Let us now assume that all particles of mass m B in the fluid are at rest and a 
particle of mass m A moves through the gas with a speed {v r ) AB - Particle A 
sweeps out a collision cylinder of radius d AB (radius of the sphere of influence) 
and volume n d AB (v r ) AB t in time t. The number of particles B that particle A 
collides with in time t is f AB t, where f AB is the collision frequency 

Iab = n B ^d AB {v r ) AB . (11.14) 


Therefore, the total number of collisions per unit volume per second, u AB , 
between particles of type A and of type B is 


vab = nAn B 7rd AB (v r ) AB = n A n B nd AB 



(11.15) 


From Eq. (11.15) we can easily write down the collision frequency v/^ between 
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identical particles: 

V M = Wa^AA^AA = 'W^Ia 


/ 16 kBT \' /2 

V ™ A ) 


(11.16) 


The extra factor of \ enters because the particles colliding are identical and it is 
needed to prevent overcounting. 

If we consider a gas of identical particles, then the mean free path, A, the 
collision frequency for a single particle, /aa, and the average speed, (v), are 
simply related by the equation 


A = ^ = (v>r 

JAA 


1 


VlriAnd^ 


(11.17) 


where r is the collision time (the time between collisions), and the average 
speed, (v), is related to the relative speed by the relation (v re i) = V2(v). 


11.B.4. Self-Diffusion 

We can now use the above ideas to obtain an expression for the coefficient of 
self-diffusion, D. Let us consider a gas of particles which are identical in every 
way except that a small fraction of the particles have a radioactive tracer 
attached (or they might have different color or different spin). Self-diffusion 
involves the transport of tracer particles through a gas of otherwise identical 
particles. If initially there is an uneven distribution of tracer particles in the gas, 
the distribution will even out and become uniform through the diffusion 
process. The rate at which inhomogeneities in the density of tracer particles 
become smoothed out is determined by the value of the coefficient of self- 
diffusion, D. 

We shall assume that the density of tracer particles, nj(z), varies in the z 
direction, while the total particle density, n, is held constant. As a first step in 
obtaining the coefficient of self-diffusion, we draw an imaginary wall in the 
fluid at z — 0 and find the net flux of particles across the wall. 

Let us first find the number of particles which hit the wall from above. We 
will look at a segment of the wall, dS, and choose the origin of coordinates to be 
at the center of dS. We next consider a volume element, dV, of the gas located at 
position r, 9, <f> above the wall (cf. Fig 11.2). The average number of tracer 
particles undergoing collisions in dV per unit time is fn T {z)dV = 
(( v )/^) n T{z)dV, where /is the collision frequency. Particles in dV leave in 
random directions (any direction is equally likely). The fraction of particles that 
move towards dS is dCl/4n, where dfl is the solid angle subtended by dS and 
dfi, = d'Sjcos 6\/r 2 . Not all particles particles leaving dV in the direction of dS 
reach dS. The probability that a tracer particle will reach dS is e~ r ! x (the 
probability that it will not undergo a collision). Combining the above results, we 
obtain the following expression for the number of tracer particles, dn T (r), 
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y 

Fig. 11.2. Only a fraction of the particles in 
volume element, dV, reach the surface area 
element, dS, without undergoing a collision en 
route. 

which collide in d V, leave directed toward d S, and reach d S without another 
collision: 

(11.18) 

The total number hitting a unit area of wall per unit time from above, N + , is 
found by integrating over the entire volume for z > 0: 

/ v ) f°° „ C n/2 2ix e~ r / x 

N + =—— r^dr sin Qd6 dcf)n T (z)cosO — =— . (11.19) 

4ttAJ 0 Jo Jo r 

For the case in which the tracer particles are distributed uniformly throughout 
the gas — that is, n(z) — constant, — Eq. (11.19) reduces to N + — n(v)/ 4. 

For small variations in tracer density we may expand nr{z ) in a Taylor series 
about the origin, 

" t{z) = " r(0) +z (c£) 0 + t ft?) / " " (11 ' 20) 

If n r (z) is a slowly varying function of z, then higher-order derivatives 
(d 2 riT/dz 2 ) 0 , ( d 3 nr/dz 3 ) 0 , and so on, will be small. Because of the factor e~ r ^ x 
in the integral in Eq. (11.19), only small values of z (values of z « A) will 
contribute. Therefore, we can terminate the expansion at (z 2 /2)(d 2 n T /dz 2 )o- 
The net transport of tracer particles in the negative z direction across a unit 
area of the wall per unit time is given by (N + — AL), where AL is the number 
crossing a unit area per unit time in the positive z direction. The expression for 
N _ is the same as for N + except that 9 is integrated from 6 = (tt/ 2) — > 7 r and 
|cos#| is changed to — cos#. Therefore, 

(v) f 00 . f 7r C 2n e~ r/x 

(N + - N-) = J ^ rr2 J sin^d'^j d(f>n T (z)cos 6—^— . 



( 11 . 21 ) 
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If we substitute Eq. (11.20) into Eq. (11.21), the first and third terms are 
identically zero and we obtain 


(N + -N_) = 


(v)A 

3 



(11.22) 


If the density increases in the z direction, then (dn T /dz ) 0 > 0 and 
(N + — AL) >0. Therefore, there will be a net transport of particles in the 
negative z direction. 

If we let Jd(z ) denote the number of tracer particles crossing a unit area 
at z in unit time in the positive z direction [Jd(z) is the particle flux or current], 
then 


where 


Mz) = -D 


dn T (z) 

dz 


(11.23) 



(11.24) 


is the coefficient of self-diffusion. If the density is a slowly varying function of 
x, y, and z, then we can write the current as in the more general form 


J D (r) = -DV r n r (r), (11.25) 


where V r denotes the spatial gradient. Equation (11.25) is called Fick’s law. 

Let us now consider diffusive flow which is changing in time. We assume 
that n T = n T (z,t) and Jo = Jd{z , t) and consider a region in the fluid bounded 
by two fixed planes, one at z and the other at z + dz . The net increase in the 
number of particles per unit area per unit time between the planes is 

J D (z + dz, t ) - J D (z, t ) = dz = ~§^{ D ~ dz ‘ ( ' 1 1 - 26 ) 


If there are no sources or sinks of particles between the planes (particles are 
conserved), then Eq. (11 .26) must equal the rate of increase of number of tracer 
particles between the planes. Thus, 


drir _ n d 2 nT 
dt dz 2 


(11.27) 


We would obtain the same equation if we assumed that the particles performed 
a random walk through the gas (cf. Chapters 4 and 5). For variations in three 
dimensions, Eq. (11.27) can be written 


dn T (r,t) 


£>V r V(r, t ). 


dt 


(11.28) 
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Thus, using kinetic theory, we have derived again the diffusion equation and we 
have obtained an expression for the diffusion coefficient in terms of the mean 
free path. We can now use these results to find an expression for the coefficients 
of viscosity and thermal conductivity. 

11.B.5. The Coefficients of Viscosity and Thermal Conductivity 

In addition to self-diffusion, other types of transport can occur in a fluid. If a 
fluid is stirred so that one part moves relative to another part (that is, if the 
average velocity of particles is a function of position (v) = (v(r))), then 
“friction” (viscosity) between the moving parts will bring them to equilibrium 
and the average velocity of the particles will become uniform throughout the 
gas. The quantity that is transported is the average velocity of the particles. The 
rate at which the average velocity is transported is determined by the coefficient 
of viscosity. 

If one part of the fluid is hotter than another part, the particles in the hot 
region will have a greater average kinetic energy than particles in the cooler 
region. Transport processes will equalize the average kinetic energy 
(temperature) in the two regions. The quantity transported is the average 
kinetic energy and the process which brings the gas to equilibrium is called heat 
conduction. The rate at which the temperatures in different parts of the gas are 
equalized is determined by the coefficient of thermal conductivity, K. 

It is possible to treat all these transport processes in a unified manner. Let us 
assume that A = A(z) is the molecular property to be transported and that it 
varies in the z direction. Let us draw an imaginary plane in the gas at z = zo- 
When a particle crosses the plane, it transports the value of A it obtained in its 
last collision and transfers it to another particle in its next collision. 

Let A(zo + Az) be the value of A transported in the negative z direction 
across the plane; A z = a\ is the distance above the plane where the particle had 
its last collision (cf. Fig. 1 1.3; A is the mean free path, and a is a proportionality 
constant). The average number of particles crossing the plane per unit area per 
unit time is n(v). The net amount of A transported in the positive z direction per 
unit area per unit time is 


dA dA 

n(v)[A(zo - Az) - A(zo + Az)] = -2n(v)Az— = -2an(v)X — . (11.29) 

dz dz 

Let us denote the net amount of A per unit area per unit time (the current) by J A . 
Then 

dA 

J A (z) = -b A n(v) A — , (11.30) 


where b A is a proportionality constant. We can determine b A from our 
expression for the coefficient of self-diffusion in Eq. (11.24). Let us now apply 
Eq. (11.30) to the cases of self-diffusion, viscosity, and heat conductivity. 
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z 



Fig. 11.3. A particle has a collision a distance A z above the plan and transports 
property A to another particle a distance A z below the plane. 


ll.B.5.1. Self-Diffusion 

Let us again consider the system in Section 1 l.B.4. The concentration of tracer 
particles per particle is A = nr(z)/n. If the density of tracer particles varies in 
the z direction, then there will be a concentration gradient ( \/n)(dn T /dz ) = 
dA/dz causing them to diffuse through space. If we let J A = J D , where J D is the 
tracer particle current, then from Eq. (11.30) we obtain 


Jd(z ) = -b A (v) A 


drir 

dz 


-D 


dn j 
dz 


(11.31) 


and the coefficient of self-diffusion is given by D = (v)A/3, if we let b A = |. 


ll.B.5.2. Viscosity 

If a gas is stirred, one part will move relative to another part. Let us assume that 
the y component of the average velocity varies in the z direction. Then 
A(z) = m(vy(z)) and J A = J^, where is the net flux of y component of 
momentum per unit area per unit time in the z direction. From Eq. (1 1.30) we 
have 


where 


'zy 


3 w dz dz 


r) = -nm(v)X 


(11.32) 


(11.33) 


is the coefficient of viscosity. From Eq. (1 1.17) we know that A = [y/lnird 2 ] 1 
for hard spheres of diameter d. Therefore, 


V lyflvd 1 


(11.34) 
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and the coefficient of viscosity is independent of density — a somewhat 
surprising result that is verified by experiment. 

ll.B.5.3. Heat Conduction 

If the temperature of the gas varies in the z direction then 

A(z) =\m{v 2 (z)) =\k B T(z) (11.35) 


and Ja = Jq is the heat current (the net flux of thermal energy per unit area per 
unit time). Since b A = ^ Eq. (11.30) gives 

Jq = ~ K Tz' (1U6) 


where 


v _ n(v)\k B 

k ~~t~ 


(11.37) 


is the coefficient of thermal conductivity. 

The coefficient of thermal conductivity can be put in another form. The 
change in average internal energy per particle as a function of z can be written 
A u(z) = mc v T(z), where c v is the specific heat. Then 


nm(v)Xc v 
K= ~ 


(11.38) 


If we compare Eqs. (1 1.33) and (1 1.38), we obtain the following simple relation 
between the coefficients of viscosity and heat conductivity and the specific heat: 


K 

1 


Cy 


(11.39) 


The above expressions for the transport coefficients have been derived by very 
simple arguments, but they do describe fairly well the observed qualitative 
behavior of transport coefficients in dilute gases. 


11.B.6. The Rate of Reaction [3] 

We are often interested in transport processes in systems in which chemical 
reactions can occur. We can use elementary kinetic theory to obtain a qualitative 
expression for the rate of the reactions. We cannot simply equate the number of 
collisions between various molecules to the rate at which they undergo 
chemical reactions. A simple example will illustrate this. Let us consider the 
reaction 2HI — > H 2 + h- The radius of the sphere of influence may be obtained 
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activation energy 



Fig. 11.4. There is an energy barrier that must be surmounted before a chemical 
reaction can occur. 


from viscosity data on HI gas (d = 4 x 10~ 8 cm). At a temperature T = 700K, 
pressure P = 1 atm, the collision frequency is easily computed and yields 
v {mf — 1-3 x 10 28 /sec. If every collision between HI molecules contributed to 
the reaction, then for a gas containing 10 23 molecules the reaction would be 
completed in a fraction of a second. However, experimentally one finds that it 
takes a much longer time to complete the reaction, the reason being that not all 
collisions lead to a reaction. 

In order to cause a reaction of the type A + B — ► C + D, a certain amount of 
the kinetic energy of A and B must be absorbed during the collision. A and B 
first form an intermediate state (AB), which then can decay into the products C 
and D. The intermediate state (AB) is called an activated complex and A and B 
are called the reactants. The amount of energy which must be absorbed in order 
to form the activated complex is called the activation energy (cf. Fig. 11.4). 

All of the energy which goes into exciting the activated complex must come 
from the energy of relative motion of the reactants. Energy in the center-of- 
mass motion cannot contribute. If e is the activation energy of the activated 
complex, then a reaction between A and B can occur only if the relative velocity 
of A and B is such that 


\VABV 2 r >e. (11.40) 

Therefore, to find the rate at which reactions between A and B can take place, 
we must multiply the collison frequency between A and B by the probability 
that A and B have a relative velocity greater than y/2e/p A s- 

The probability that the molecules A and B have center-of-mass velocity 
in the range V cm — > V cm + d\ cm and a relative velocity in the range 
v r — > v r -f- d\ r is given by 


F(V cm , Vr)dy, cmdy r = 


'(3Mab\ 3/2 ( Ppab \ e -m)(M AB v c l+» ABV ?) d y cm dYt 
2n J \ 2n ) 


(11.41) 
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The probability that A and B have a relaive velocity in the range v r — > v r + d\ r 
is found by integrating Eq. (11.41) over the center-of-mass velocity. We then 
find 


P(\ r )d\ r = e ^ 2 ^ ABV '^d\ r . (11.42) 

The probability that A and B have a relative v elocity v r > y/2e/ fi AB is found by 
integrating Eq. (11.42) from v r = \f2ef fi AB to v r = oo and integrating over all 
angles. Thus, 





(11.43) 


=/(/?,£)«-*. 


The probability that a reaction takes place depends exponentially on the 
activation energy. The quantity f((3e) is a function of temperture and activation 
energy. Its form is not important here. 

We can now write the following qualitative expression for the rate of reaction 
of A and B. The number of reacting molecules, Nr, per second per unit volume 
is 


N R = K(0e)v AB e- f,t , (11.44) 

where u AB is the collision frequency between molecules A and B, and the 
coefficient K(/3e) depends on activation energy and temperature and may also 
depend on the geometry of the interacting molecules A and B. We may rewrite 
Eq. (11.44) in the form of a rate equation. If we let n A denote the number of 
moles of A per unit volume and use Eq. (11.15), we can write 

= ~K'(e, (3 , fi AB)d AB e~ Pe n A n B = -k AB n A n B . (11 .45) 

In Eq. (11.45), K'(e(3^ AB ) is a function of e,(3, and \i AB and may depend on 
the geometry of A and B. The quantity k AB is called the rate constant for the 
reaction. It depends exponentially on the activation energy. We see that the 
activation energy is the most important quantity in determining the rate of a 
chemical reaction, since a small change in e can cause a large change in k AB . 
Equation (11.45) gives the rate of decrease of A in the fluid due to the reaction 
A + B — ► C + D. It is a second-order rate equation because it depends on the 
square of the molar density of species in the fluid. It is the simplest form of rate 
equation one can have because the rate of change in the concentration of A is 
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determined by only one reaction. There will often be a number of competing 
reactions taking place in the fluid which determine the rate of change in the 
concentration of a given type of molecule. Then the rate equation for a given 
reaction can be a more complicated function of the concentrations. 

In general, the expression describing a given chemical reaction is written in 
the form 


ki 

-v a A - v b B ^ v c C + u d D 
*2 


(11.46) 


where are the stoichiometric coefficients and tell us how many molecules of 
each species must combine to enable the reaction to take place. By convention, 
the stoichiometric coefficients on the left-hand side are negative and those on 
the right-hand side are positive. The constant k\ is the rate constant for the 
forward reaction and k 2 is the rate constant for the backward reaction. The rate 
of change of A can, in general, be written 

= -kin^nfg^ + k 2 n%n% (11 .47) 

where \v A \ denotes the absolute value of v A . 

When more than one reaction contributes to the rate of change of a given 
species, the rate equations can become more complicated. For example, the 
simple process A — > B + C cannot be explained using collision theory. One 
must introduce an intermediate in the process. Consider the following two 
reactions: 


2 A^A+A* (11.48) 

ki 

and 

A*^B + C. (11.49) 

Two molecules, A, collide to form one A an an excited A*. A* then decays into B 
and C. The rate equations for this process may be written 


dn A 

dt 


-k\n 2 A + k 2 n A n A * , 


(11.50) 


dn A * 

dt 


= k\n 2 A — k 2 n A n A * - k 3 n A *, 


(11.51) 


and 



(11.52) 
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We can solve Eqs. (1 1.50)— (1 1.52) if we make the assumption that the 
concentration of A* is roughly constant — that is, that the rate of formation of A* 
is equal to the rate of decay of A*. Then 


^ = 0*n A .= A . 

dt kj + k 2 riA 


(11.53) 


With Eq. (11.53), we obtain the following equation for the rate of increase of 
n B : 


dn B __ ki k 3 n 2 A 
dt ^3 + k 2 n A 

If k 3 > k 2 n A , then dn B /dt « k x n 2 A and the rate of formation of B looks as if it 

were governed by the process 2 A^>B + C. If k 2 n A > £ 3 , then dn B /dt = 
(kik 3 /k 2 )n A and the rate of formation of B looks as if it were governed by the 

process A -+B + C, where k' = k\ki/k 2 . 

In general, more than one chemical reaction will contribute to the rate of 
formation or depletion of a given molecule. As we can see from the above 
example, experimental measurement of the rate equations is not always 
sufficient for determining the type of chemical processes which are occurring in 
a fluid. 

Now that we have built some intuition about the transport coefficients, we 
will derive microscopic expressions for them based on a more rigorous 
microscopic theory — that is, one based on the Boltzmann equation. 


ll.C. THE BOLTZMANN EQUATION [4-8] 

Until now, we have treated transport processes from a phenomenological point 
of view and we have obtained very simple expressions for the transport 
coefficients which are independent of the detailed form of the interaction. Most 
of the rest of this chapter will be devoted to a derivation of the transport 
coefficients starting from a kinetic equation similar to that in Eq. (6.75), which 
is an exact equation for the one-body reduced probability density. However, Eq. 
(6.75) cannot be solved exactly. In practice, we try to obtain an approximation 
to it which contains the important physical features of the system we are 
considering. In subsequent sections, we shall be interested in one of the 
simplest types of nonequilibrium systems — a dilute gas of particles which 
interacts via a short-range, spherically symmetric interaction potential. We shall 
assume that inhomogeneities in the gas are slowly varying. The form of the 
kinetic equation which gives the time evolution of the probability density of 
such a system is called the Boltzmann equation. Although it can be derived 
directly from Eq. (6.75) the arguments are rather long and tedious. In this 
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section, we shall derive the Boltzmann equation from simple physical 
arguments. 

Let us consider a dilute gas of particles of mass, m, which interact via a 
spherically symmetric potential </>(jq, - q ; |). We shall assume that the time a 
particle spends between collisions is very much longer than the duration of a 
collision. We shall describe the behavior of the system in terms of a number 
density /( p, q, t) rather than a probability density. The distribution function, 
/(p,q,f), gives the number of particles in the six-dimensional phase space 
volume element p — * p + dp, q — ► q + dq. It is related to the reduced 
probability density p\ (p, q, t ) and the distribution function F\ (p, q, t) through 
the relation 


/(P,q.O = Wpi(p,q,t) =^F,(p, q, t) (11.54) 

(pi(p,q,t)) and F { (p,q, t) are defined in Section S6.A). 

Let us now consider a volume element, AVi = dp } Ar, lying in the region 
Pi -^Pi+rfpi, r— »r + Ar of the six-dimensional phase space. We shall 
assume the volume element Ar is large enough to contain many particles and 
small enough that the distribution function /(p,r,t) does not vary appreciably 
over Ar. We wish to find an equation for the rate of change of number of 
particles in AVi. This change will be due to free streaming of particles 
into (and out of) AV! and to scattering of particles into (and out of) AVi 
because of collisions. The rate of change of/(p,,r, t) may be written in the 
form 


d A 

dt 


1 <9qj dt 


collisions 


(11.55) 


The first term on the right is the contribution due to streaming (flow through the 
surface of A VO, and the second term is the contribution due to collisions. If an 
external field were present, then an additional streaming term of the form 
-Pi • d/i/dp! would be present on the right. 

We shall assume that the gas is dilute enough that only two-body collisions 
need be considered. Furthermore, we shall assume that all collisions are elastic. 
Before we can write an expression for <9//<9t| coI1 , we must review some aspects 
of two-body scattering theory. 


11.C.1. Two-Body Scattering [9] 

In this section we shall review those aspects of two-body scattering theory 
necessary to derive the Boltzmann equation. We first discuss the dynamics of 
the scattering process and then derive an expression for the scattering cross 
section. We shall restrict ourselves to elastic scattering processes. 
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11. C. 1.1. Dynamics of the Scattering Process 

We consider two particles, one with mass m\ and displacement rj (t) at time, t, 
and another with mass ra 2 and displacement r 2 (t) at time t. The particles 
interact via a short-range interaction, V(|r 2 — r 2 |). It is useful to introduce the 
relative displacement, r = r 2 — r } , and the center-of-mass displacement, 
R = ( m\/M)r\ + (m 2 /M)r 2 , where M = ni\+m 2 is the total mass. They are 
related through the equations (cf. Fig. 11.5) 

ri = R — — r and r 2 = R 4 - (11.56) 

1 M M v ’ 


The momenta of particles 1 and 2 are Pi and P 2 , respectively. The center of 
mass momentum is P = p + p 2 . The relative momentum is p = (m\/M)p 2 — 
(m 2 /A/)pi. 

The total Hamiltonian for the system can be written 


H -=i + i +v ^- r '^m +P i +v ^= E '- 


(11.57) 


The Hamiltonian decomposes into a part that depends only on the center-of- 
mass coordinates, H cm — P 2 /2M = E cm , and a part that depends only on the 
relative coordinates, H = p 2 / 2p + V(r) = E, so H tot = H cm + H. Therefore, 
the center-of-mass motion evolves independently from the relative motion. 
From Hamilton’s equation, P = -dH tot /d R = 0, so the center-of-mass 
momentum, P = p, + p 2 , is a constant of the motion. 

We will assume that the scattering is elastic, so the total kinetic energy, 
pf/2mi+p%/2m 2 , is conserved. Because the potential, V(|r 2 — ri|), is 
sphericlly symmetric (it depends only on the distance, |r 2 — rj |), the total 
angular momentum vector, L = ri x p } + r 2 x p 2 , is a constant of the motion. 

Let us assume that initially the particles are far apart and free but approach 
one another with momenta, Pi and p 2 , in the lab frame. Once they approach 
close enough to feel the interaction, the scattering process begins and their 
momenta will be altered. After the collision process is finished, they fly apart 


ZL 



Fig. 11.5. Relation between lab frame coordi- 
nates and center-of-mass and relative coordinates. 
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with final momenta, pj and p' 2 . Conservation of momentum and of kinetic 
energy (for elastic processes) then gives 

P cm = Pi + P 2 = Pi + P2 

and 

2 2 f2 f 2 

p\ , pj = Pi | Pi 
2m i 2 m 2 2m\ 2 m 2 ’ 

respectively. One can combine Eqs. (1 1.58) and (1 1.59) to show that for elastic 
collisions, the magnitude of the relative velocity, |v 2 - \\ |, is unchanged by the 
collision. The relative velocities before and after the collision are g = V 2 — Vi 
and g' = v 2 — v'j , respectively. Then for elastic collisions 


(11.58) 

(11.59) 


* = |g| = |g'l, (11.60) 

where g is the magnitude of the relative velocity before and after the collision. 

Because angular momentum is conserved, the scattering process occurs in a 
plane perpendicular to the angular momentum vector. We can choose the z axis 
to lie along the angular momentum vector. Then, the scattering process takes 
place in the x-y plane. If we introduce polar coordinates x = r cos (f) and 
y = r sin (0), the relative Hamiltonian can be written 

H = p 2 /2p+V{r) = ~[p 

where p r = pr and p^ = pr 2 <p. Hamilton’s equation take the form 

dpr. = _<M = tf L _dV 

dt dr pr 3 dr ’ 

dp<t> _ dH __ 
dt df 

Since dp^/dt = 0, the magnitude of angular momentum, p^, is a constant of the 
motion. We therefore set p<p = /, where / is a constant. Since df/dt = 1/ pr 2 , we 
can write 


dr _ dH _ p r 
dt dp r p ’ 

d(p _ dH _ Pfj, 
dt dp^ pr 2 


(11.62) 



(11.61) 


,, l fj l dr (l/r*)dr 

9 pr 2 r 2 p r yj2 p{E - V(r)) - P/r 2 ' 


(11.63) 


Equation (1 1.63) relates the polar angle, <f>, to the distance between the particles 
at any time during the scattering process. 
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Let us now consider the scattering process in more detail. Let us assume that 
at time t = — oo, mass m\ is at rest at the origin and mass m 2 has coordinates 
x = +oo and y = b and is moving parallel to the x axis in the negative x direc- 
tion with a velocity \ 2 = \ 0 = — vox. The quantity b is the impact parameter. 
Thus, at time t = -oo we have (r t = 0, \\ = 0, r 2 = oox + by, \ 2 = -vox) 
(cf. Fig. 11.6a). The center-of-mass velocity is \ cm = m 2 \ 0 /M = p\o/m\ and 
the relative velocity is v = — vox. The total angular momentum is L = ri x pj-f- 
r 2 x p 2 = —m 2 vobz. The angular momentum can also be divided into center-of- 
mass angular momentum, L cm = MR x \ cm , and relative angular momentum, 
L re i = pr x v = pbv oz, so that L = L cm + L re /. Since the magnitude of the 
relative velocity, g = v o, is unchanged by the collision, the impact parameter, b, 
is the same before and after the collision. Therefore, g and b are intrinsic 
parameters of the elastic collision processes. 

At the end of the scattering event, the particles move apart with constant 
relative velocity. At time t = — oo we have (f> = 0. At time t = +oo we have 
(f) = 0/- The angle, 0, between the incident relative velocity and the final 
relative velocity is called the scattering angle and it satisfies the relation 
© = 7r — (cf. Fig. 1 1 .6). 

It is useful to view the scattering process from the center-of-mass frame of 
reference, which is the reference frame in which the origin of coordinates is 
fixed to the center of mass and moves with it. In the center-of-mass frame, the 
displacement of m\ is ri )C = —m 2 r/M and the displacement of m 2 is 
r 2c = mjr/M. Similarly, their velocities are Vi )C = —m 2 y/M and 
y 2c = m\\/M, respectively. The relation between the final velocities, v^ and 
v 2c = mig'/M, of mass m 2 in the lab frame and center-of-mass frames, 
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Fig. 11.7. Relation between lab frame 
velocity, V2, and center-of-mass frame 
velocity, \' 2c , of mass m2. 


respectively, is shown in Fig. 1 1.7. From this figure we can relate the scattering 
angles, 9 and 0, as seen from the lab and center-of-mass frames, respectively. 
Equating components of the vectors in Figure 11.7, we find 

v 2,c sin ( 0 ) = v 2 sin W> 
and 

V2 )C cos(0) -f v cm = v' 2 cos (9). 

We can combine these equations to obtain 

£ + cos(0) 

COS (9) = — • • ■■ ■ =. 

yj 1 4- £ 2 4- 2£ COS (0) 

where £ = V cm /v' 2c • The magnitude of the center-of-mass velocity is 
V cm = ftvo/mi. Also, for elastic collisions we have v 2c = m.\Vo/M and 
£ = m 2 /m\. It is useful now to introduce the concept of scattering cross section. 

ii.C.2.2. The Scattering Cross Section 

The scattering problem may be viewed in the lab frame as that of a beam of 
particles of mass, m 2 , incident on a mass, mi, which is at rest. Let I be the 
intensity of the incident beam (number of incident particles/time-area). 
Consider a circular area element of the incident beam consisting of those 
particles with impact parameter in the interval b — ► b + db and azimuthal angle 
element da (cf. Fig. 11.8). These particles are scattered into the solid angle 
dcu = sin (9)d9da in the lab frame and into the solid angle dCl = sin (Q)dOda 
in the center-of-mass frame. The angle, a, is the same in the two frames. The 
number of particles/time, dN, scattered into the solid angle du = sin (9)d9da in 
the lab frame and into the solid angle dQ = sin (®)d@da in the center-of-mass 
frame is given by (a is the azimuthal angle) 

dN = Ibdbda = I(TiabSin(9)d9da = Ia cm sin(@)dQda, (1 1.67) 

where ai a b = 0 iab(g,b) and a cm = er cm (g, b) are the differential scattering cross 
setions in the lab and center-of-mass frames, respectively. Equation (11.67) 


(11.64) 

(11.65) 

( 11 . 66 ) 
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da 



Fig. 11.8. Relation between the impact parameter and the scattering angle in the lab 
frame. 


gives the relation between the impact parameter, b, and either the lab frame 
differential cross section, ai ab , or the center-of-mass differential cross section, 

®cm- 


bdb bdb 

sin (0)d6 a Gcm | sin(0)d© | ’ 


( 11 . 68 ) 


The differential scattering cross section is independent of a because of the 
spherical symmetry of the potential. As b increases, 6 and 0 decrease. The 
absolute value signs in Eq. (11.68) are necessary for positive cross sections. 
The total scattering cross section, cr tot , is given by 


&tot 


sin d0 


*27t r n r 2 tv 

da<j cm (b,g) = sin (6)d6 \ daa [ab (b,g). (11.69) 
o Jo Jo 


The scattering cross section can be thought of as the effective area cutout of the 
incident beam by the scattering process. This is clear in Exercise 11.1, where 
we compute the cross section for an incident beam of hard-sphere particles 
colliding with a fixed hard sphere. 

From Eqs. (11.66) and (11.68), the relation between the differential 
scattering cross sections in the center of mass and the lab frames is given by 


~t~ £ 2 4- 2 £cos(0)] 3//2 
(1 -j- £cos (©)) 


(11.70) 


In Fig. 11.9 we picture the scattering process in the center-of-mass frame of 
reference. In the center-of-mass frame, the scattering event, (p t , p 2 ) — » ► (p'j , p^), 
and the reverse scattering event, (p'^p^) — > (pi,p 2 ), are identical except that 
the directions of the particles are reversed. Therefore, cr cm (b,g ) is the 
differential scattering cross section for both the forward and the reverse 
scattering events in the center-of-mass frame. 
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Fig. 11.9. Scattering process as viewed from the center-of-mass frame. Here 
Vi )C = - m 2 g/M , v' lc = -m 2 g/M, \ 2 , c = m x g/M, and = m x g'/M. 


■ EXERCISE 11.1. Compute the total scattering cross section for elastic 
I scattering of a point particle of mass m off of a hard sphere of radius R which 
is fixed in position. Assume that the incident particle has velocity Vo and 
impact parameter b. 

Answer: The fact that the hard sphere is fixed in position means that it, in 
effect, has an infinite mass and Vi = 0. If we let m 2 = m and m\ = oo, we 
find V cm = 0. Therefore, £ = 0 and the scattering angles in the center-of- 
mass and lab frames are equal, cos(0) = cos(@). Also the collision cross 
sections in the center-of-mass and lab frames are equal, a i a b — &cm- 
The collision process is shown in the accompanying figure. 



Let (f> o be the angle of closest approach. Then the scattering angle is given 
by O = 7 T- 20o- The impact parameter is b = R sin(0o) = /?cos(©/2) so 
that, db = -(i?/2)sin(©/2)cf© and bdb = -(/? 2 /2)sin(©/2)cos(©/2)d© = 
~(R 2 /4)sin(S)d@. Therefore 

1 

! <?cm(b,g) = 


b db 


sin(©) d@ 


R^ 

~4 


( 1 ) 
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For this special case, the differential scattering cross section is independent 
of angle. The scattering is isotropic. The total cross section, cr tot , is given by 

°tot= f (T cm (b, g)2n sin (Q)d@ = ttR 2 . (2) 

Jo 

Thus, the total cross section is just the area of the fixed scatterer as seen by 
the incoming beam. 


■ EXERCISE 11.2. Compute the differential scattering cross section, er cm , 
for two particles of mass m\ and m 2 which interact via a repulsive Coulomb 
force with potential V(r ) = where k is a positive coupling constant. 
Assume that the incident particle has velocity vq and impact parameter b. 


Answer: For this scattering process, the angular momentum is a constant of 
the motion. Assume that the scattering process occurs in x-y plane. The 
Hamiltonian for relative motion, in polar coordinates, is 

- 2 / 2 




K _ 

+ ~-E, 


( 1 ) 


where p = m\m 2 / (mi +■ m 2 ) is the reduced mass, p r is the radial component 
of momentum, and = l is the magnitude of the angular momentum. We 
will assume that m\ is initially at rest, and m 2 is incident from infinity with 
velocity vo = — vox and impact parameter b. The magnitude of p 2 r can be 
written 


2 _ 9 7- 2 ■/*« I 2 _ 2 ^£ 


r 2 - 


kt 


2 pE 


(2) 


The particles must climb a potential hill. The greater the initial kinetic 
energy, the closer they can get before the repulsive force causes them to 
move apart. The distance of closest approach, ro, is the value of r for which 
p r — 0. This is al so called the turning point. From Eq. (2) we find 
r 0 = («/2E)(l + V 1 + 2 1 2 E/plk). 

The scattering process is shown in the accompanying figure. 
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The scattering process is symmetric about the point of closest approach. Let 
us make use of the relation, dcj) = ldr/p r r 1 and integrate the relative distance 
from r = oo to r = tq. Then we find 


<t> 0 


rr o 


d<f> = 


dr 


ry/2pEr 2 — Ifinr — l 2 


= sin 1 


7 T 
2 ’ 


(3) 


where e = y/\ + 2l 2 E/ fm 2 . The scattering angle, 0, is related to the angle 
0o through the relation © + 20o = tt- From E q. (3 ) we find l /e = sin (0/2). 
This in turn leads to cot(©/2) = Ve 2 — l = y/2EP } /m 2 . However, the 
relative angular momentum is / = pvo b = bsJlfiE , so that cot(©/2) = 
2 Eb/n. Thus, we find 


K /© 

b — — cot — 
2 E \2 


(4) 


Equation (4) gives us a relation between the scattering angle, 0, and the 
impact parameter, b, from which we can compute the differential scattering 
cross section. We find 

_ I | _ 1 / k \ 2 1 

0cm ~ |sin(0)de| _ 4 \2E> sin 4 (0/2) ' 1 ’ 

Equation (5) is the Rutherford formula for repulsive scattering between two 
electrically charged particles. The Coulomb force is too long-ranged for the 
arguments leading to the derivation of the Boltzmann equation to apply. An 
interaction potential is considered short-ranged if it falls off faster than 1 / r 2 . 


11.C.2. Derivation of the Boltzmann Equation 

Let us denote the rate of particles scattered out of AVi by (df- / dt) Ardp v The 
number of particles in AVi at time t with coordinates Pi — ► Pi + <^Pi and 
r r -t- Ar is/(p l5 r, ^Arcfpj. All particles of momentum p 2 lying within a 
cylinder of volume dq 2 = 2'nbdbgdt collide with particles pj in time dt. The 
number of such particles is /(p 2 , r, i)2 / Kbdbgdtdp 2 - The total number of 
collisions, A(p t p 2 — > pjp 2 ), per unit volume in time dt between particles of 
momentum pj and particles of momentum p 2 resulting in new momenta p' t and 
p 2 is given by 

W(PiP 2 -*• piPs) = 27 rgb db dt f{p 2 ,r,t)f{p l ,r,t)dp l dp 2 . (11.71) 

In Eq. (11.71) we have assumed that the distribution functions do not change 
appreciably in position space for the volume element AVi we are considering. 
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Also, we have assumed that the particles p } and p 2 are completely uncorrelated. 
This assumption is called molecular chaos , or Stosszahl-Ansatz . 

In analogy to Eq. (11.71) we may write for the inverse scattering process 

N(p 1>P2 Pi > P 2 ) = 'Z'Kgb db dt /(Pi, r, t)/(p 2 , r, t)dp\dp’ 2 . (11.72) 


For elastic collisions, dp x dp 2 = dp\dp 2 . (This is easily proved if we note that 
dp\dp 2 = dPdp and dp[dp 2 = dP'dp'. Furthermore, for elastic collisions, 
dP = dP' and p 2 dp = p' 2 dp f . Therefore, dp x dp 2 = dp\dp 2 .) We may now 
combine Eqs. (11.67), (11.71) and (11.72) to obtain the following expression 
for the net increase, {df\/dt) coX] dp x , in number of particles with momentum 
Pi Pi + dp { per unit volume per unit time: 



dpi 

coll 


= dp x 



dClgcr cm (b,g) 


x ( /( Pi , r, t) /( p 2 , r, t) - /( Pl , r , t ) /( p 2 , r , t ) ) , 


(11.73) 


where dQ = sin (Q)dQda, and cr cm is the center-of-mass collision cross section. 
If we now combine Eqs. (11.55) and (11.73), we obtain 


df\ . <9/i f, 

¥ +q, 'a? - r P2 


d SlgOcm {b, g) ( fy f 2 '-flf 2 ), (11 -74) 


where /j =/(p, , r, t). Equation (1 1.74) is the Boltzmann equation. Note that we 
have written the second term with df\ / dr rather than df\/d({ x (cf. Eq. 1 1.55) to 
indicate that Eq. (11.74) is valid for long wavelength disturbances. Since 
/(P> <1> 0 = (N/V)Fi(p, q, t ), the Boltzmann equation has the same form as the 
exact kinetic equation (6.75) except that the right-hand side is expressed 
entirely in terms of the one-particle distribution function, /( p,-,r, f). The 
Boltzmann equation is a nonlinear integrodifferential equation for/(p ; , r, t). 


11.C.3. Boltzmann’s H Theorem [4] 

Boltzmann’s equation describes the time evolution of the distribution of 
particles in six-dimensional phase space for a dilute gas with inhomogeneities 
which are slowly varying in position space. If no external fields drive the 
system, it should decay to equilibrium after a long time. Boltzmann’s equation 
describes such behavior. To show this, Boltzmann introduced a function H ( t ) 
which he defined as 

H{t) = J | dq Y dp! /( pj , qi , f)hi /( Pl , q, , t). (11 .75) 

He then showed that if /( p t ,qj,t) satisfies the Boltzmann equation, H(t) always 
decreases because of the effect of collisions. The proof goes as follows. We first 
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take the derivative of H(t ): 


— = rfpt^qt^rtin/i + i] 


(11.76) 


and then substitute Eq. (11.74) into Eq. (11.76). This gives 

?T= - f \dVudqi(<h ~V n /i + *1 


* f f f 

dqj j dp! dp 2 j dClga cm (b , g){f v f 2 > -f\ / 2 >[ln/i + 1]. 


(11.77) 


We next change the first term on the right-hand side into a surface integral and 
assume that for large pj and q, we have/(p, , q x ,t) —*■ 0. With this assumption 
the first term on the right-hand side gives no contribution and we have 

= dq y jdpi |afp 2 \dSlgcr cm (b,g)(f v f 2 > -/i/ 2 )[ln/i + 1]. (11.78) 

We now can rewrite Eq. (11.78). If we interchange coordinates p, and p 2 we 
obtain 


dq ! dpjjdpj dQg<j cm (b,g)(f v f 2 > -f\f 2 )[\nf 2 + 1]. (11.79) 

We next add Eqs. (11.78) and (11.79) and divide by two to obtain 

~ = i jdq, | dp, |dp 2 jdQ«a cm (fc,g)(/,./ 2 . -/,/ 2 )[ln/, +ln/ 2 + 2). 

(11.80) 

As a final step, we make a change of dummy variables pj <-> p j and p 2 *-*■ p 2 in 
Eq. (11.80), add the result to Eq. (11.80) and divide by two. If we remember 
that dp { dp 2 = dp\dp' 2 , we find 

~ = ^dq x jdp! |dp 2 dng<r cm (b,g)(f v f 2 >-f l f 2 )ln~~ ^0. (11.81) 

Equation (11.81) is always less than or equal to zero since a function of the 
form (y - jc)ln x/y is always less than or equal to zero. 

The derivative dH/dt will be zero only if fv fo =f\ f 2 for all collisions. This 
is the condition of detailed balance and is the equilibrium condition for the gas. 
It can also be written in the form 


ln/i + ln/ 2 = In f v + ln/ 2 /. 


(11.82) 
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At equilibrium, the single particle distribution must be independent of absolute 
position in the gas. Thus, Eq. (11.82) is a condition which must be satisfied by 
some function of the momentum of particles before and after collisions. The 
only microscopic quantities that are conserved in a collision are the total 
momentum, the kinetic energy, and a constant (particle number). Therefore, in 
equilibrium In f\ must be of the form 


ln/i — A + B • pj + C ~- 
2m 


(11.83) 


and/i reduces to 


/l(Pl) ~ e A+B •P 1 + C Pi/2m j 

where A, B, and C are constant. This is the general form of the Maxwell- 
Boltzmann distribution for a gas whose average momentum is not necessarily 
zero, and it is a stationary solution of the Boltzmann equation. 

Since H(t) always decreases with time, the negative of //(f) will always 
increase with time. Boltzmann identified the following quantity as the 
nonequilibrium entropy S(t): 


S(t ) = -k B H(t) = -k B 


dqi rfPi /( Pi , qi , t) In /( Pl , , f) . ( 1 1 .84) 


This entropy differs from the Gibbs entropy in that it depends only on a reduced 
distribution function and not on the full distribution function as does the Gibbs 
entropy (cf. Section 7.A). 


ll.D. LINEARIZED BOLTZMANN AND LORENTZ- 
BOLTZMANN EQUATIONS 

We wish to obtain microscopic expressions for the coefficient of self-diffusion, 
D, thermal conductivity, K, and shear viscosity, 77 for a two-component system 
whose distribution function varies in time and space according to the 
Boltzmann equation. 


ll.D.l. Kinetic Equations for a Two-Component Gas 

Let us consider a dilute gas of identical particles and place a radioactive 
tracer on half of the particles but not change their properties in any other way. 
Then the dynamics of the two kinds of particles and the collision cross sections 
will be identical. The Boltzmann equation for the normal particles can be 
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written 


dfi,N , . dfi y N 

~W + q '~dT 


J P2 I dClg(j{b,g) a0 . Nl {f V)a f2' ) l3 ~f\, N h n ), 

q,/?,7 ^ ^ 

(11.85) 


and for the tracer particles it can be written 

+ f Jp 2 \ d ^(b,g) ap . T ^fv,af2\/3 ~ fl,T fl n ), 

a ,/?, 7 ^ ^ 

(11.86) 


where the summations over a, (3, and 7 are over normal and tracer particles and 
we have used the notations fi yN = r, t) and f\j = /r(Pi , r, t). The number 
of tracer and normal particles is conserved during collisions. Thus, only the 
cross sections ctnn-.nn = &nt-.nt = cftn-.tn = vtt.tt = cr arc nonzero. Note that 
these are center-of-mass cross sections. 

When we compute transport coefficients, it is sufficient to consider systems 
with small-amplitude disturbance from equilibrium. We then expand the normal 
and tracer particle distributions about absolute equilibrium. If we denote the 
normal particle equilibrium distribution by /^(Pi) and the tracer particle 
equilibrium distribution by /^(Pi), we have 

/"(Pi) =/"(P.) =/° 7 -(Pl) = J (^) 3/2 ^ /2 "- ( U - 87 ) 


We have assumed that there are N/2 tracer and N/2 normal particles in the 
system and have let no = N/V. 

As a first step in linearizing the Boltzmann equation, we can write 

/v(Pi,r,r) =/°(Pi)[l +MPi> r >01 (11.88) 

and 

/r(Pi,r,t) =/°(Pi)[l +MPi»r,01, (11.89) 


where /^(p^r, t) and h T {\> x ,r,t) denote small amplitude disturbances in the 
normal particle and tracer particle distributions, respectively. 

If we substitute Eqs. (11.88) and (11.89) into Eqs. (11.85) and (11.86), 
respectively, and neglect terms of second order or higher in h N ^ (p 1; r, t), we 
obtain 


( dh\^ 

V dt 


+ Qi 


^r) = S j'to |<«lS^(*,?)^:« T / 0 (P2) 


x (hl',a -H h2',/3 — — h. 2 yl ) 


(11.90) 
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and 




E 


dp 2 


(mg<T(b,g) a/3:Tl f 0 ( p 2 ) 


x (/* 1 ', Q -f h2',/3 ~ hij — ^ 2 , 7 ). 


(11.91) 


In Eqs. (11.90) and (11.91), we have used kinetic energy conservation to write 
/°(Pi)/°(P 2 ) — /°(Pi)/°(P 2 )- Equations (11.90) and (11.91) are the linearized 
Boltzmann equations for the normal and tracer components, respectively. 

We can now decouple diffusion effects from viscous and thermal effects. Let 
us define the total distribution function as 


h + (p u r,t) =h N (p u r,t)+h T (p u r,t) (11.92) 

and the distribution function for the difference in normal and tracer distribution 
as 


h (p l5 r,t) = fyv(Pi,r,f) - h T {\> x ,r,t). (11.93) 

As long as h~ (p t , r, t ) is nonzero, diffusion will occur in the field. 

The equations of motion for the total distribution, A + (p 1 ,r, t), and for the 
difference distribution, /^(p^r, t), decouple. If we add Eqs. (11.90) and 
(11.91) we find 

^■p + qi lfr =2 \ dV2 g) f°(V2)( h v + h 2> ~ h \ ~ h i), 

(11.94) 

which is just the Boltzmann equation for the total particle distribution. If we 
subtract Eq. (11.91) from (11.90), we find 

+ - ^ = 2 \ d V2^dttgo{b,g) f°(v 2 )(hY ~ h \)- (H-95) 

Equation (11.95) is called the Lorentz-Boltzmann equation and is the kinetic 
equation describing diffusion. 


11.D.2. Collision Operators 

It is convenient to introduce the concept of a linearized collision operator. Let 
us consider Eq. (11.94). We write it as 


dht . dh\ 

■aT + q, "aF 


= C,+A+, 


(11.96) 
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where C x is the Boltzmann collision operator , which, when acting on some 
arbitrary function ^(pj), yields 

CiS(Pi) = ^dp 2 ^(mg(T{b,g)f 0 ($ 2 )(gv+g2> - 8 1 -gi)- (11.97) 


Let us now introduce a scalar product of two functions </>(pj) and x(Pi) : 


{<t>, X) = 


—) 

2nmJ 


3/2 


p,)x(p,). 


Using Eq. (11.98) it is easy to show that C + is self-adjoint: 

<$,C+x) = (C + $,x) 

(we leave it as a homework problem). 

If we write the eigenvalue equation for C + in the form 


(11.98) 


(11.99) 


C+*(p) = A* (p), 


(11.100) 


we can show there are five eigenfunctions of C + with eigenvalue zero. They are 
the five additive constants of the motion: 1, p, and p 2 /2m. All other eigenvalues 
of C+ are negative. We prove this by writing the expectation value of C+ in the 
form 


<$,££$) = - 


1 N 
4V 



dp x dp 2 


dne~ i(3/2m)ip2 i +pl 2 ) 


x ga(b,g)($v + $ 2 ' ~ $1 - $ 2 ) 2 - 


( 11 . 101 ) 


Thus, ($, Cp $) is less than or equal to zero for an arbitrary function $(p). It 
will be equal to zero only if ($v + $ 2 ' - $1 - $ 2 ) = 0 — that is, if $(p) is a 
linear combination of the five additive constants of motion 1, p, and p 2 f 2m. 
Therefore, C+ is a negative semidefinite operator with a nonpositive spectrum 
of eigenvalues. 

The linearized collision operator C+ has one other interesting property. It 
behaves as a scalar operator with respect to rotations in momentum space. That 
is, C+/i(p) transforms in the same way under rotation in momentum space as 
does /i(p). Therefore, eigenfunctions of Cp have the form 

^r,/,m(p) = R r ,l(p)Yl im ( P/|P|), (11.102) 


where T/,m(p/jp|) are spherical harmonics. 

Let us now consider a simple example. Let us solve the linearized 
Boltzmann equation (11.94) for the case of a spatially homogeneous system. 
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Equation (11.94) then becomes 


dh\ 

dt 


* f 

= CtK = 2 dp 2 daga(b t g)fl[h l ,+hr-h^-h 2 \, (11.103) 


where h\ = h + (p l ,t), and so on. If we assume that Ai"(Pi,0) = 
r ,i,m^r,i,m^r,i,m(Pi), we may write the solution in the form 

hi (Pi,*) = ^2e Xrlt A rM ^ rAm ( pj. (11.104) 

r,l,m 


The fact that the eigenvalues X r j must all be negative or zero means that 
h\ (p t , t) will decay to a time-independent quantity after long enough time and 
the system relaxes to equilibrium. 

The Lorentz-Boltzmann collision operator , Cf , appearing in the equation for 
the difference distribution has simpler structure than the Boltzmann collision 
operator. If we write 


^-+q.-^r = c r /.r. (n.105) 

the collision operator Cf, when acting on an arbitrary function 4>(p,) of p,, 
yields 


C7*( Pi) = 2 


d p 2 




(11.106) 


The Lorentz-Boltzman collision operator Cf differs from C/~ in that it has only 
one zero eigenvalue, a constant, while the Boltzmann collision operator, C/\ 
has a five fold degenerate zero eigenvalue. 

We can use the Boltzmann and Lorentz-Boltzmann equations to obtain 
microscopic expressions for the coefficients of self-diffusion, viscosity, and 
thermal conductivity. 


■ EXERCISE 11.3. Write (XiC + X) in a form that makes explicit the 
conservation of kinetic energy and momentum during the collision process. 

Answer: The relative velocity before the collision is g = V 2 — vi , and after 
the collision it is g' = where |g| = |g'| = g. Let us introduce a unit 

vector, e, which lies in the scattering plane and on the symmetry line 
between the initial and final directions of the relative velocity, as is shown in 
Fig. (a). 
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z 

A 

i 



ib) 


Using e, we can write 

| g' = g-2e(e-g) and g' ■ g = g 2 cos(0) = g 2 (l - 2cos 2 (</> 0 )), (1) 

I 

where the scattering angle, 0, satisfies the condition, 0 = 7 r — 2 (f> 0 . In Fig. 
(b) we draw the scattering plane in three-dimensional space, with the initial 
J relative velocity vector, g, lying along the z axis. If we note that 
dCt = d(cos(0))da:, we can write 

gdQ = f g'dg'dQMg' - g) = [ g'dg'dmg' - g) | dV c J(V' cm - V em ) 

= f|pV' m dg'S(E' - E)6(V' ca - Vo.), 

(2) 

where dg' = g' 2 dg'dft. We have used the fact that the total kinetic energy 
can be written 

E = y O'? + v l) = m ( V cm +5^ 2 ) = m (V?m +W 2 )- (3) 

I ^ 

Note that 6(0 - E)6(\' cm - V em ) = 6((m/4)(g* - g 2 ))^ - Vo.) and 
S((m/4)(g' 2 — g 2 )) = — g). We can also write Eq. (2) as 

gdQ = ^ dp[dp' 2 6(p? + pi - p\ - p\)8{p\ + P 2 - Pi - P 2 ) , (4) 

where we have used the fact that m 6 dV' cm dg! = dp[dp f 2 and 8{E' — E ) 

I 5 ( v cm - Vcm) = 16 m 4 6(p\ +pl -pf -p%)6 ( Pi + p 2 - p') - P 2 ). 
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We can now use the above results to write the Boltzmann collision operator 
in the form 


C£*(Pi) =l~ J <*P2 j d?\ j dp' 2 a(g, Q)8(p\ +p\- pf - p%) 

x + P2 - Pi - P , 2 )^ _/3 ^ /2 w (x(p'i) + x(P 2 ) - x(Pi) ~ X(P 2 ))- 

(5) 


In Eq. (5), we have written the scattering cross section as a function of 
scatering angle, 0, rather than impact parameter, b. Using Eq. (5), we can 
write (x, C + x) in a form which shows explicitly the conservation of 
momentum and energy. We find 


(x, C + x) = - 


2 N 
mV 



dp\ 




dp' 2 v{g, 6 ) 


x &(P 2 \ +Pl~ p'\ - P 2 ) 6 ( Pi + P 2 - Pi - P 2 ) 
x e -mmM+p\) {x{ pj) + x(p ') _ x(Pl) _ X ( P2 )) 2 . 


( 6 ) 


11.E. COEFFICIENT OF SELF-DIFFUSION 

We shall begin with a derivation of the coefficient of self-diffusion because it is 
the easiest to obtain. The method we use is due to Resibois [10] and consists of 
two steps. In the first step, we derive the linearized hydrodynamic equation 
from the Lorentz-Boltzman equation and introduce the self-diffusion 
coefficient into the hydrodynamic equation using Fick’s law. We then can find 
the dispersion relation for the hydrodynamic diffusion mode. In the second step 
we use Rayleigh-Schrodinger perturbation theory to obtain the hydrodynamic 
eigenvalues of the Lorentz-Boltzmann equation. We then match the eigenvalue 
of the hydrodynamic equation to that of the Lorentz-Boltzmann equation and 
thereby obtain a microscopic expression for the self-diffusion coefficient. 


ll.E.l. Derivation of the Diffusion Equation 

The difference in tracer and normal particle densities at some point r is given by 


m(r, t ) - n N ( r, t) - n T ( r, t) = 


dPif°(Vi)h (p,,r, t) 


(11.107) 


(at equilibrium this difference is zero). If we multiply the Lorentz-Boltzmann 




COEFFICIENT OF SELF-DIFFUSION 


689 


equation by/°(p 1 ) and integrate over p b we obtain 

r\ 

- m(r , 0 + V r • J D (r , t) = 0, (11-108) 

where J D (r, t) is the diffusion current and is defined microscopically as 

J D (r, t) = d Pi“/°(Pi)MPi> r >0- (11.109) 

The contribution from the collision term is identically zero because Cp ^ 1=0. 
We now introduce the self-diffusion coefficient using Fick’s law: 

J D (r, t) = — DV r m(r, t). (11.110) 

If we combine Eqs. (11.108) and (11.110), we obtain the following 
hydrodynamic equation for the self-diffusion process: 

-m(r,<)=DVjm(r,0- (11.111) 

To find the dispersion relation for hydrodynamic modes we define the Fourier 
transform 

m(r,t) = — |[dk dum(k, u)e l ^ k ' T ~ wt \ (11.112) 

(2 7r) J - 

which allows us to study each Fourier component of the diffusion equation 
separately. If we substitute Eq. (11.112) into Eq. (11.111) we obtain 

— ium(k, u) + Dk 2 m(k,u) = 0 (11.113) 

(different Fourier components do not couple, because the hydrodynamic 

equation is linear). From Eq. (11.113) we obtain the following dispersion 
relation for the self-diffusion mode: 

uj=-iDk 2 . (11.114) 

The diffusion frequency is imaginary, which means that the contribution to the 
density m(r, t) with wavevector k dies out in a time which depends on the 
diffusion coefficient and the wavevector k : 

m{ r,f)~e ikr <r D ‘ 2 '. (11.115) 

Thus, very-long-wavelength disturbances take a long time to decay away. This 
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is the characteristic behavior of a hydrodynamic mode. Since the identity of the 
particles is preserved in each collision, the only way to cause difference in the 
density of normal and tracer particles to disappear is to physically transport 
particles from one part of the fluid to another. For very-long-wavelength 
disturbances the equalization takes a long time since the particles must be 
transported over long distances. 

11.E.2. Eigenfrequencies of the Lorentz-Boltzmann Equation 

We can obtain the hydrodynamic eigenfrequencies of the Lorentz-Boltzmann 
equation in terms of a perturbation expansion in powers of the wavevector k by 
using the Rayleigh-Schrodinger perturbation theory [11]. After this has been 
done we can equate the eigenfrequency of the hydrodynamic equation to the 
hydrodynamic eigenfrequency of the Lorentz-Boltzmann equation and thereby 
obtain a microscopic expression for the coefficient of self-diffusion. 

Since Eq. (11.105) is a linear equation for A - (p x , r, f), we can consider each 
Fourier component separately and write 

h~{p,r,t) = |tf n (p,k))_e ,k ^r ,w "'; (11.116) 

we obtain the following eigenvalue equation for |^„(p,k))_ : 

(c;-to.£)|$„(p,k)). = -,a-„-|$„(p,k)>. (11.117) 

where k = k/|k| and the eigenfunctions, |\]>„(q, k))_, are assumed to be 
orthonormal. For long-wavelength hydrodynamic disturbances, k will be a 
small parameter. Thus, we can use Rayleigh-Schrodinger perturbation theory to 
obtain a perturbation expansion for u n in powers of k. 

Let us assume that both and |^„(q, k))_ can be expanded in powers of k. 
Then 


u n = + kdp + -| (11.118) 

and 

I*,). = |*i 0) }_ + fcl*' 11 )- + it 2 1®< 2) >. + • • • . (11.119) 

If we substitute Eqs. (11.118) and (11.119) into Eq. (11.117), we obtain the 
following expression for frequencies and 

JO) = i _ < *(°)|C£|#(®>)_, 


(11.120) 

(11.121) 
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and 



(11.122) 

where the matrix elements are defined as in Eq (11.98) and At 

this point we shall restrict our attention to the eigenvalue which reduces to zero 
when k — » 0. This corresponds to the hydrodynamic mode. There will be only 
one such eigenvalue of C~ since there is only one zero eigenfunction of C ~ , 
namely a constant. 

J (O') A 

We will let |^j )_ denote the zero eigenfunction of C~ and we will normal- 
ize it using the scalar product in Eq. (11.98). Thus, 


i*l 0) >- = i 

and we find 


LJ 

U 


(!) = 1 




■ p, 


— o, 


and 


u 


( 2 ) 




dp { e 0p V 2m K - 


1 



K Pi 


(11.123) 

(11.124) 

(11.125) 

(11.126) 


The hydrodynamic eigenfrequency, u>\, has the correct hydrodynamic 
behavior. If we now equate Eqs. (11.114) and (11.118) and use Eqs. 
(11. 124)— (1 1. 126), we find 


D = j 

m z 




Thus, we have obtained a microscopic expression for the self-diffusion 
coefficient, D. In Section ll.G we will discuss how to evaluate the integral in 
Eq. (11.127). 


ll.F. COEFFICIENTS OF VISCOSITY AND THERMAL 
CONDUCTIVITY 


The Boltzmann collision operator, Cp"\ has five zero eigenvalues (it is fivefold 
degenerate) and therefore we can derive from the linearized Boltzmann 
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equation five linearized hydrodynamic equations: one equation for the total 
particle density, n(r, t) = n N (r, t) +nr(r,t); three equations for the three 
components of the momentum density, mn(r,t)v(r,t), where v(r, t) is the 
average velocity; and one equation for the average internal energy density. To 
find the hydrodynamic eigenfrequencies (which will be expressed in terms of 
the coefficients of viscosity and thermal conductivity), we must find the normal 
modes of the system of five hydrodynamic equations. We then use the 
Rayleigh-Schrodinger perturbation theory to find the five microscopic 
hydrodynamic frequencies of the Boltzmann equation. Once they are found 
we can match them to the five normal mode frequencies of the hydrodynamic 
equations and thereby obtain microscopic expressions for the coeffcients of 
viscosity and thermal conductivity. 


11.F.1. Derivation of the Hydrodynamic Equations [12] 

The average particle density is defined by 


n( r, t ) 


frfp/°(p)[l +/i + (p,r,t)]. 


(11.128) 


If we multiply Eq. (11.94) by /°(Pi), integrate over p b and make use of the fact 
that Cp +) 1 = 0, we obtain 


|n(r,t) + V r -J"(r,r) = 0, 

where J n (r, t) is the average particle current and is defined as 


J"M - 


dp~f°{p)h + (p,r, t ). 
m 


(11.129) 


(11.130) 


It is useful to decompose J"(r, t) into the product J”(r, t) — n{ n, f)v(r, t ). In 
the linear regime, J"(r, f)«n 0 v(r,(), where no is the equilibrium particle 
density. Eq. (11.129) then becomes 


Q 

— n(r,t) + noV r • v(r,f) = 0. 
ot 


(11.131) 


Equation (11.131) is the linearized continuity equation and describes the 
conservation of total particle number. The continuity equation is entirely 
reactive. If we reverse time in Eq. (11.131), we do not change the form of the 
equation. Thus, the continuity equation contains no irreversible effects. These 
must come from the other hydrodynamic equations. 
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If we multiply Eq. (11.94) by p, /°(Pi), integrate over p b and use the fact 
that Cp r) p = 0, we obtain 

m|j"(r,l) = -V r -PM, (11-132) 

where P(r, t) is the pressure tensor and is defined 

P{r,t)=~ dp/°(p)pp/i + (p,r,f) (11.133) 

(we have let q = p/m). The pressure tensor describes the momentum flux, or 
current, in the system and contains an irreversible part due to viscous effects. 

To obtain the equation for the internal energy density, we must take the 
average of the thermal kinetic energy (l/2m)(p - mv(r, t)) 2 . However, in the 
linear approximation, v(r,t) does not contribute. Thus, to find the hydro- 
dynamic equation for the internal energy density, we can multiply Eq. (11.96) 
by /?i /2m and integrate over p!. If we use the fact that C^p 1 = 0, we obtain 

|«(r,() = -V r -J“(r,r), (11.134) 

where u(r , t) is the internal energy per unit volume, 

w(r,t) = dp^/°(p)/z + (p,r,t), (11.135) 

and J“(r, t) is the internal energy current, 

J“(r,t)= f°{p)h + {p,r,t). (11.136) 

J zm m 

We can write the internal energy density in the form u(r,t) = n(r, f)e(r, t), 
where e(r, t) is the internal energy per particle. In the linear approximation we 
find w(r, t) = noe(r, t ) + e 0 n(r, t ), and Eq. (1 1.134) takes the form 

n 0 1 e(r, t) + e 0 ~ n(r, r) = - V r ■ J“(r, t), (1 1 . 1 37) 

where e 0 is the equilibrium internal energy per particle. The current J“(r, t) will 
contain an irreversible part due to thermal conduction. 

Equations (11.131), (11.132), and (11.137) are the linearized equations of 
motion for the average particle density, n(r, t), the momentum density, 
mj"(r, t ), and the internal energy density, w(r, t). They govern the variation in 
time and space of these quantities. However, they cannot yet be solved because 
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they contain too many unknowns. As a next step we must write expressions 
for the pressure tensor, P(r,t), and the energy current, J“(r, t). The form 
that these two quantities must take was already discussed in Section 10.B. 
From Eqs. (10.7), (10.30), and (10.31), we can write the pressure tensor in the 
form 


Pij{r, 0 = t)6ij - C<5«/(V r ' v ) “ ■ v) , (1 1.138) 

where P{ r, t) is the hydrostatic pressure, £ is the coefficient of bulk viscosity, 
and 77 is the coefficient of shear viscosity. If we substitute Eq. (11.138) into Eq. 
(11.132) we obtain, after some rearrangement, 

mn° — v(r,t) = -X7 r P(r,t) + r)V 2 r \{r,t) + (C + 5 *?)V r (V r • v(r,t)). 

(11.139) 

The contribution from the hydrostatic pressure term is reactive [it has the same 
dependence under time reversal as the left-hand side of Eq. (1 1.139)], whereas 
the viscous contribution is dissipative (it changes sign relative to the left-hand 
side). All terms in Eq. (11.139) are linear in deviations from equilibrium and 
depend only on first-order gradients. Thus, we have assumed that all spatial 
variations have very long wavelengths and higher-order gradients can be 
neglected. 

Let us now consider the energy equation, Eq. (11.137). The energy current 
will have a contribution from the convection of internal energy, a contribution 
involving the pressure which comes from work done in compression or 
expansion of regions in the fluid, and a contribution from heat conduction [cf. 
Eq. (10.25)]. Thus in the linear approximation we can write 

J“(r, t) = n 0 e 0 \{r,t) + P 0 v{r,t) - KV r T{r,t), (11.140) 

where Pq is the equilibrium pressure, K is the coefficient of thermal 
conductivity, and 7(r, t) is the temperature. If we substitute Eq. (11.140) into 
Eq. (11.137) and make use of Eq. (11.131), we find 

no|e(r,0 = -/> 0 V r • v(r,r) + *rV r 2 r(r,r). (11.141) 

Equation (11.141) simplifies when we write it in terms of the local entropy. If 
we use the thermodynamic relation de = Tds -f ( P/n 2 )dn , we can write 



COEFFICIENTS OF VISCOSITY AND THERMAL CONDUCTIVITY 


695 


If we combine Eqs. (11.141) and (11. 142) and again use Eq. (1 1 . 1 3 1 ), we obtain 


r\ 

no Toj t s(T,t) = KVlT^t) 


(11.143) 


where Tq is the equilibrium temperature and s(r, t) is the local entropy per 
particle. 

The hydrodynamic Eqs. (11.131), (11.139), and (11.143), which have been 
derived from the Boltzmann equation with the help of thermodynamic relations, 
are identical to Eqs. (10.36)-(10.38). If we choose the density, n(r, f), and the 
temperature, T(r, t), to be the independent variables, the hydrodynamic 
equations take the form 

dn 

— + n 0 V r • v = 0, (11.144) 



(11.145) 


and 


mno 



°dn 

— + mno 


ds\°dT 

pTj p dt 


K 

To 


vjr. 


(11.146) 


Equations (11.1 44)— ( 11.146) are coupled equations for variations in average 
density, velocity, and temperature. We now must find normal mode solutions. 
Since the equations are linear, each Fourier component will propagate 
independently. Thus, we only need to consider a single Fourier component. 
We will write 


n(r,t) = n k {v)e lkT , 

(11.147) 

v(r, t) = v k (u;)e' k ' r , 

(11.148) 

T(r,t) = T k (cj)e ik ' T . 

(11.149) 


We will also separate the average velocity, v k , into a longitudinal part, 
v" = v"k, and a transverse part, v 1 (cf. Section 10.C). 

The hydrodynamic equation for the transverse velocity, which describes the 
propagation of shear waves, takes the form 

{-iuj 4- u,k 2 )y^(uj) = 0, 


(11.150) 
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where v t = ri/mno denotes the transverse kinetic viscosity. The dispersion 
relation for the transverse mode is given by Eq. (11.150). It is 

u=—ii/ t k 2 . (11.151) 

There are two shear modes with this frequency. Thus, we find that the shear 
waves behave very much like the diffusion waves in Section 1 1 .E. Any shear 
disturbance will be damped out. 

The longitudinal modes satisfy the equation 


— IUJ 


ICJ 

ikc\ 

7 Po 

cjctp 

Pol 


-ipok 

[iu -f uik 1 ] 


0 

ikc^ap 

7 


1 



= o, 


0 


^ [iu -f 7X& 2 ] J 


\f k (u)J 


(11.152) 


where Co = \J (dP/dmn) s is the speed of sound, a P = —(l/no)(dn/dT) p is the 
thermal expansivity, i — cp/cmn , v\ = (l/p 0 )(C + \ P) is the longitudinal 
kinetic viscosity and x = K/mn^cp (cf. Section 10.C). The dispersion relations 
for the longitudinal modes can be found by setting the determinant of the matrix 
in Eq. (11.151) to zero and solving fora;. To second order in wave vector k, we 
find 


uji = 


—i 


M 


nocp 


and 


UJ± = db C'ok — 


ik 2 
2mno 


4 (\ 

-p + C + mKi — 
3 \c v 



(11.153) 


(11.154) 


The first solution [Eq. (11.153)] is purely imaginary and the second two [Eq. 
(11.154)] are complex. (Small k means long- wavelength disturbances.) 

The wave of frequency u;i corresponds predominantly to an entropy (or heat) 
wave. It is damped out exponentially with time. 

The waves of frequency u>± (waves going in opposite directions) correspond 
to pressure or longitudinal velocity waves (sound waves). These waves 
propagate, but in the presence of transport processes they eventually get 
damped out. Thus, at long wavelengths there are two shear modes: one heat 
mode, and two sound modes in the fluid. The frequencies in Eqs. (11.151), 
(11.153), and (11.154) are the normal mode frequencies of the system. These 
five frequencies, together with the diffusion frequency, Eq. (11.114), give the 
six normal mode frequencies of this two-component, dilute gas of dynamically 
identical particles. 
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11.F.2. Eigenfrequencies of the Boltzmann Equation [10, 13] 

The second step in deriving microscopic expression for the thermal 
conductivity and shear viscosity is to obtain the hydrodynamic eigenfrequencies 
of the linerized Boltzmann equation. Just as in Section 1 1 .E we only need to 
consider one Fourier component of the linearized Boltzmann equation. If we let 

h + ( p, r, t) = |»„(p, k)) +e *' r e-""', (11.155) 

we obtain the following eigenvalue equation for the eigenvectors: 

(Cp - ikn • p/m)|^„ (p, k)) + = k)) + . (11.156) 

To obtain a perturbation expression for u n in powers of k, we must first look at 
the eigenvalue problem for the unperturbed operator C+. We will denote the 
eigenfunctions of C+ by \<f> n ). Then 

c;\4>„) = -iu£|</>„). (11.157) 

We know that C+ has five zero eigenvalues and, therefore, five degenerate 
eigenfunctions. Orthonormalized eigenfunctions with eigenvalue zero are given 

by 


\<h) = i, 

(11.158) 

II 

(11.159) 

ii 

(11.160) 

ft, 

II 

(11.161) 


and 

i 05 >= v!H + ^ p2 )- (iu62) 

We shall denote the five eigenfunctions collectively as |0 Q ), where 
a = 1,2, 3, 4, and 5. All other eigenfunctions will be denoted collectively as 
1 0/j ) where (3 = 6, . . . , oo. The eigenfunctions \<j> n ) are assumed orthonormal 
with respect to the scalar product in Eq. (11.98). 

Since the eigenfunctions | <f> a ) are degenerate, we must first find the proper 
linear combination of them to use for the zero-order approximation to the exact 
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eigenfunctions |\P a ) + . That is, we must find some |^^) + such that 

l*«>+ = l^ 0) >+ +<=|4'i 1) )+ +^l^ 2) > + + ■", (11.163) 

where 

l^ 0) >+ = Y, (11.164) 

a' 

The process of determining of the coefficients c aa > will also give us the first- 
order term, in the perturbation expansion of uj a : 

= <*4 °' } + ) + H . ( 1 1 . 165) 

To find the coefficients c QQ / and let us first insert the expansions in Eqs. 
(1 1.158)— (1 1.162) and (11.165) into Eq. (11.156) and equate coefficients of 
order k. We then obtain 

c; |4'< 1, ) + = +^r) i*i 0) > + - (n-166) 

If we multiply Eq. (11.166) by (0 a //|, we obtain 

*i 0) > + = ((f>a», (*c • p/m) V% ) ) + (11.167) 

(note that since Cp~^ is self-adjoint, (cf) a » | is the same as its dual vector |0 a "))> 
or, using Eq. (11.164), we obtain 

(^q ^ (-aa" — ^ ^ ^a^a'^aa' > (11.168) 

a' 


where 




K P 

m 

3/2 


(f) a i 


-(£) jdpe-^^(p)^Mp)- 


(11.169) 


If we use the expressions for j cf) a ) (a = 1, ... ,5) appearing in Eqs. (11.158)- 
(11.162) and assume that k lies along the x axis so that p ic = p x , we find by 
explicit calculation that 




(11.170) 
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^=^=(j) (i 


(11.171) 


For all other a" and a’, W a » a i = 0. 

We may obtain values for from Eq. (11.168). We first write it in matrix 
form and set the determinant of the matrix to zero: 

det(W-a; (1) l) = 0. (11.172) 

If we evaluate the above determinant, we obtain the following equation for a/ 1 ); 

(-Ji)f =0. (11.173) 


From Eq. (11.173) we obtain the following first-order corrections to uj a : 


,.o) = = 1 _L 

1 2 rnp 


(11.174) 


4° = = JV = °. (11.175) 

In Eq. (11.174), cq = (§ (1/m /?)) 1 / 2 is the velocity of sound of an ideal gas. 
Notice that we have lifted the degeneracy of only two of the states |^ a ) + . We 
have to go to higher orders in the perturbation expansion to lift the degeneracy 
in the rest of the states. 

Now that we have expressions fora^ and W aa >, we can use Eq. (11.168) and 
the condition that the states |\P^) + must be orthogonal to obtain expressions 
for Coca' and, therefore, for |$v) + . Substitution of Eqs. (11.170), (11.171), 
(11.174), and (11.175) into Eq. (11.168) gives the following results: c\j, = 
C U =£23 = C2A = c 32 = c 42 = ^52 = 0, C 15 - (l )^ 2 C\2 = (I ) X ^C\\, Cp = 

7 C 22 = (§) / c 2 i,c 35 = -(|) / 2 c 3 i,c 45 = -(I ) 17 c 4 i,andc 5 i = -(f ) 1 ^ 55 . 
The condition of orthonormality of j^^) + gives us C 12 = C 22 = {2)~ l , 
C55 = (f) 1/7 , and c 3 i = c 34 = c 4 j = c 43 = cs 3 = c 54 = 0. We therefore obtain 
the following expressions for the wave functions |^^) + : 

l*S 0) >+ =^5 [(j) ,/2 |^> + \<h) + (|) ,/Z |05>], (11.176) 

l« , 2 0) > + = ^ [®' / 2 |<fo> - \<h) + (|) 1/2 |05>i, (11.177) 


i«4 0) > + = m, 

l*f> + = I'M, 


(11.178) 

(11.179) 
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and 

l*f > + = + ® 1/2 I05>]. (11.180) 

We will use the state |^^) + as the basis states for the perturbation theory. The 
general expression for off has been given in Eq. (11.122). We thus find the 
following expressions for the five hydrodynamic frequencies: 

Ut = -co/c + <* 2 + (*| 0) [ (^ + co) i (£ + «o) |*S 0) ) + . (11.182) 

{u - m) 

u 4 = ik 2 ^ 0) \ , (11.184) 

+\ 4 I m C+ rn 4 /+ v ’ 

and 

a > 5 = ik 2 (V 0) . (11.185) 

+\ 5 m C+ m I 5 /+ v 

The frequencies u\ and uj 2 may be identified with sound modes, frequencies 
and u 4 may be identified with shear modes, and frequency u 5 may be identified 
with the heat mode. 


11.F.3. Shear Viscosity and Thermal Conductivity 

Given the microscopic expressions for the hydrodynamic frequencies in Eqs. 
(11.181 )— ( 11.185), we can match them to the frequencies obtained from the 
linearized, hydrodynamic equations and obtain microscopic expressions for the 
transport coefficients. If we equate Eqs. (11.151) and (11.183), we find for the 
shear viscosity 

'--w Gy 7 j < iu86) 


If we equate Eqs. (1 1.153) and (1 1.185), we obtain for the thermal conductivity 



Qp 2 /2m \ P^£ _ 5 ' 

[2m 2 


(11.187) 
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Note that the shear viscosity is expressed in the form of a momentum current 
correlation function and the thermal conductivity is expressed in the form of an 
enthalpy current correlation function. 

The expression of the transport coefficients in terms of correlation functions 
can be made still more explicit. Let us consider the viscosity. We can write Eq. 
(11.186) in the form 


’ ? = " 0/J (2^) 7 1 dp£ ' /,p!/2m lo dT] *y ei,TJ *> - (H.188) 

where J xy = p x p y /m = p x v y is a microscopic current, or flux, of x component of 
momentum in the y direction. If we denote 

•Mr) = /'%,, (11.189) 

the viscosity becomes 


r) = n 0 /3 


dT (JxyJxyirY) 


eq' 


(11.190) 


Equation (11.190) clearly expresses the viscosity in terms of a momentum 
current correlation function. We found in Chapter 10 that this is quite a general 
result. 


ll.G. COMPUTATION OF TRANSPORT COEFFICIENTS [6] 

The coefficients of self-diffusion, D, thermal conductivity, K, and shear 
viscosity, rj, may be written in the form 




m 2 \2 tt m 


K = n Q k B f P \ 
m 2 V 2m n 


3/2 f 

) I 


dpe 


-0p 2 /ln 


Px\, 


(11.191) 


) ' j dve-to 1 !** (11.192) 


and 


•»= (ii) 372 J (11-193) 


respectively, where we have used the ideal gas expression for the heat 
capacity per particle, c P = | k B . The functions D x , A x , and Bxy satisfy the 
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equations 


and 


Cp A X — pxi 



2m 



Cp &xy — PxPyi 


(11.194) 

(11.195) 


(11.196) 


These results are identical to the results obtained using the more traditional 
Chapman-Enskog procedure [6]. In order to obtain numerical values for the 
transport coefficients, we must expand the unknown functions A x , A x , and B xy 
in terms of a set of orthogonal functions, 5” (*), called Sonine polynomials, 
which are closely related to Leguerre polynomials. We describe these 
polynomials below. 


ll.G.l. Sonine Polynomials 

The Sonine polynomials form a complete set of orthogonal polynomials which 
may be used to obtain approximate expressions for the transport coefficients. 
The Sonine polynomials are defined as 


s*( x ) = v (-D 1 r (g + ” + 1 V 

U ] T( q + l+l)(n -/)!(/!)’ 


(11.197) 


where x and q are real numbers, n is an integer, and T(q + n + 1) is a gamma 
function. The two cases of the gamma function we shall be concerned about are 


J , 1\ 1 • 3 • 5 • • • (2n — 3) (2n — 1) \fir 

r { n + 2) = 2 ” ' 


(11.198) 


and T(n + 1) = n!. The Sonine polynomials for n = 0 and n = 1 are easily 
found to be 


s°W = i 


(11.199) 


and 


Sj(*) = q 4- 1 —x. (11.200) 

The Sonine polynomials have the orthogonality property 

J dxe~ x jfiS^(x)S^(x) = ^ + ^ 6 n>n >. (11.201) 
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Below we use the Sonine polynomials to obtain approximate expressions for the 
transport coefficients. We consider them one at a time below. 

11.G.2. Diffusion Coefficient 

In Eq. (11.191) the diffusion coefficient is expanded in terms of an unknown 
quantity, A x , Let us expand A* in terms of Sonine polynomials, S l 3 > 2 (f3p 2 /2m), 
so that 


00 


Ax — dlp x Sy 2 

1=0 



(11.202) 


We use these particular Sonine polynomials because in terms of them the 
diffusion coefficient becomes particularly simple. Let us substitute the 
expansion, (11.202), into Eq. (11.191) and perform the integration. Then 


D = - 





dp e 


-pp 2 /2mp2£l 


3/2 



m* 


S' 1 '! 


3/2 oc roc 




(11.203) 


J l r2n 

d cos (6) sin 2 ( 0 ) J d(f> 


COS 2 ( (f ) ) = 

m(3 


Thus, the self-diffusion coefficient depends only on the coefficient do. 

We now must find an approximate expression for do. Let us substitute Eq. 


(11.202) into Eq. (11.194), multiply the resulting expression by 
(/3/2nm) 3 / 2 e~P p2 / 2m S l y 2 {(3p 2 /2m)p x and integrate over p. Equation (11.194) 
then takes the form 


where 


oo 

Y. D/',/d/ = — 6p t o, (11.204) 

i=o P 


Di',i = 


0 


2nm 


3/2 


| dpe-^S'^ 



Px S 


l 

3/2 



(11.205) 


Equation (11.204) is actually an infinite set of equations which determines all 
the coefficients, di. 

The self-diffusion coefficient only depends on do. We can obtain an 
approximate expression for the self-diffusion coefficient if we use a truncated 
version of Eq. (1 1 .204). Let us retain only the first v + 1 terms in the sum and 
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write 

£ ^ 8, f> for ( = 0,l,...,v. (11.206) 

The set of coefficients D^\ form a {v + 1) x (u + 1) matrix. The coefficient 
is given by 

d M = «([ D M]- , )„ ()i (11.207) 

where ([D^] _1 ) 00 is the (0,0) matrix element of the inverse of matrix, D^K 
The coefficient of self-diffusion now becomes 


D = -^lim([I)M]-V (H- 208 ) 

V — >00 ’ 

When the matrix is truncated to lowest order, the self-diffusion coefficient is 
given by 


D = — 


1 1 

Jfi Ax) 


1 1 

P 2 (PxC+p x ) ' 


(11.209) 


To second order it is 


D = - 


1 J_ 

P 2 Dqo 


DoiDio 

DooDn 


(11.210) 


and so on. 

The self-diffusion coefficient is straightforward to compute for a hard sphere 
gas (see Exercise 11.4). For hard spheres of radius R it is 


1 1 3 1 f hT 
0 2 fcC+ft) _ 32no« 2 V rrm 


(11.211) 


In practice it is found that higher-order corrections only change Eq. (1 1.21 1) by 
a few percent (cf. Ref. 6, page 168). 


11.G.3. Thermal Conductivity 

In Eq. (11.192) the thermal conductivity is written in terms of an unknown 
quantity, A x . Let us expand A x in terms of Sonine polynomials, Sy 2 (ftp 2 /2m), 
so that 

( u - 212) 

If we substitute the expansion, Eq. (11.212), into Eq. (11.192) and perform the 
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integration, we find 


K = - 




5 

2 


p 2 x s 


3/2 



- + 


n 0 k B l (3 \ 


rrv- 


) $! L 00 r 00 / 

5>J # pU P e-^S\ /2 (^js‘ /2 

d cos (6) sin 2 ( 6 ) [ d<f> cos 2 {(f)) = . 

-l Jo 2mp 



(11.213) 

The thermal conductivity depends only on ai. 

Let us next determine a^ In Eq. (11.195) expand A x in Sonine polynomials, 
then multiply by {j3/2nmy 2 e~ l3pl ^ m /2m)p x and integrate over p. 

Equation (11.195) then takes the form 




(11.214) 


(11.215) 


In evaluating the integral on the righthand side of Eq. (11.214), we have used 
the orthogonality of the Sonine polynomials. Note that since the collision 
operator, C+ , conserves momentum we have Cp p x = 0. Therefore, M// ;0 = 0 
and Mo,/ = 0. Equation (11.214) becomes 


00 C 

for l’> 1. (11.216) 

/=! 1 " 


Using a similar approximation as in Eq. (11.209), we write the coefficient of 
thermal conductivity as 


25 nok B 1 _ 25 tiok B 1 


(11.217) 


In Exercise 1 1.4 we compute the thermal conductivity for a gas of hard spheres 
of radius R. We find 


75 Mr 
256 R 2 V mK 


(11.218) 
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TABLE 11.1. Thermal Conductivity, A, in Units of 10 3 W/ 
m.K. for Some Monatomic Gases at Pressure P = 10 s Pa. 
[14] 


Gas 

K(200K) 

K (300 K) 

AT (400AQ 

Argon (Ar) 

12.4 

17.9 

22.6 

Helium (He) 

119.3 

156.7 

190.6 

Krypton (Kr) 

6.4 

9.5 

12.3 

Neon (Ne) 

37.6 

49.8 

60.3 

Xenon (Xe) 

3.6 

5.5 

7.3 


Equation (11.218) is obtained by truncating Eq. (11.216) to lowest order. 
However, it is good to within a few percent (Ref. 6, page 168). Values of the 
thermal conductivity for some simple monatomic gases are given in Table 11.1. 


■ EXERCISE 11.4. The coefficient of thermal conductivity can be written 


K = 




2m 2 


PxA x , 


where A x = a ip x S l 3 j 2 ((3p 2 /2m). Compute the coefficient of thermal 

conductivity, K, for a gas of hard-sphere particles of radius R, keeping only 
the lowest order term (/ = 1 ) in the expansion for A x . 


Answer: First note that 


K= - 


= + 


npk B 

m 2 

n 0 k B 

m 2 




PxA x 



5 n 0 k B 
2 m(3 


ai, 


(1) 


so the thermal conductivity depends only on ai. Since the coefficient of 
thermal conductivity can be written 


K = - 


npks 







5' 

2 


Px, 


( 2 ) 
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we have that 


C+A x = 


2m 2 


If we multiply by (/3/27rm) 3//2 $ dp e Pp 1 / 1 ” 1 S* ( j3p 2 /2m)p x and keep only 
the lowest order term in the expansion of A x , Eq. (3) takes the form 

Mua i = -“, ( 4 ) 


where 


Mu — (Sl/lPx Cp S\/ 2 Px) 

= (2LTI (£)*• (5> 

We can use the fact that momentum is conserved during the collisions so 
C£p x = 0. Therefore Mu becomes 


M„ = 


( P 1 PxCtp 1 p x ) 


\ N (3 2 10 

= m 

4 V 4 


| dy 1 dy 2 e~^ +v ^ 2 ^ 


X | d(cos (0))^a^cr(©,g) 

x {V\>jcV 2 v + v 2 >v| - V\jv\ - V 2 jcv \) 2 ■ 

In order to evaluate the integral in Eq. (6), we introduce the center-of-mass 
velocity, V cw = \ (vi + v 2 ), and the relative velocity, g = v 2 — Vi, so that 
Vi = Vcm-jg, v 2 = V cm + ±g, v r = V cm -±g', and v 2 ;=V cm + ±g'. 
Then after some algebra we find 

vi',*v? + v 2 ', x vl - v M v? - v 2 , x v\ = g' x (g' • V cm ) - g x (g • V cm ) (7) 


A/T 1 N (5 2 10 

Mll = "4 vT m 


d\ cmdg, 6 


-Pm(V? m +g 2 / 4) 


I d( cos (&))dagcr(&,g) 


x Ife) 2 (g' • Vc») 2 - 2gU*(g' • V«) (g • Vcm) + «?( g • V_)‘ 



708 


TRANSPORT THEORY 


The collision cross section for hard spheres of radius R is 
<j(©, g) = (2R) 2 / 4 = R 2 . Let V cm ,i (i = x, y, z) denote the ith component 
of the center-of-mass velocity. Then 

| d\ m e-W™ VcmjVcmJ = — (—) 6 tj . (9) 


If we integrate over the center-of-mass velocity, we find 


M _ _i a p to (j_v_l fjL 

11 4 V 4 \2vm) 2m/3 [mD 


dgge 


-0mg 2 /4 


x d( cos (G))da[g ,2 g' 2 - 2g' x g x (g' • g) + g 2 x g 2 } 

- — ( JL.\ _L ( ' #2 dgge- pmg2 ^ 

3 4V 4 \27rm) 2mp\mp) 

x d{ cos {G))da [g 4 - (g' • g) 2 ]. 


In Eq. (10), we have let gl \ g 2 inside the integral, since by symmetry 
(S 3 / 2 PX S\/ 2 Px) = (S l 3 / 2 Py Cp S\/ 2 Py) = (S 3 / 2 PZ Cp Sy^Pz). Now choose 

the z-axis to lie along g. Then g' x = gsm (©) cos (a), g' y = g 
sin (©) sin (a), g' z = g cos (©) and g' • g = g 2 cos (©). First integrate over 
d(cos (©)) da. Then integrate over dg. We finally obtain 


64 n Q R 2 y/nm 

Ml1 " “T /J3/2 ’ 

and coefficient of thermal conductivity is given by 


256 R 2 V mn 


11.G.4. Shear Viscosity 

In Eq. (11.193), the shear viscosity is written in terms of an unknown quantity, 
Bxy. Let us expand Bxy in terms of Sonine polynomials, S l 5 / 2 (Pp 2 /2m), so that 

B v = g b ,p x p y S ‘ s/2 (@£) , (1 1.219) 

Let us substitute the expansion, Eq. (1 1.219), into Eq. (11.193) and perform the 
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integration. Then 


V = 


*-*■'>» !,(€) 


_nd3 /J\ g b r “ 6 d -0^1 f (1 1.220) 

m 2 V2tt mj ^ Jo ' \2m ) 

J 1 rl'K 

d cos (6) sin 4 (6) J dcf> sin 2 {<}>) cos 2 (<f>) = — 


nobo 

~T 


Thus, the shear viscosity depends only on bo- 

In order to determine b 0 , substitute the expansion Eq. (11.219) for Bxy into 
Eq. (11.196). Then multiply by e~ /3p2 / 2m S l y 2 ((3p 2 /2m) p x p y and 

integrate over p. Equation (11.196) then takes the form 


N *v bl= ^2 6l ’^ 


1=0 


( 11 . 221 ) 


where 


n "=G LT\ d ^ / 2 ms ^(S) p ^ s 


/ iPp 2 

5 / 2 V 2 m 


PxPy 

(11.222) 

If we truncate Eq. (11.221) to lowest order, we can write the shear viscosity as 
now 2 1 mom 2 1 


r) = 


(11.223) 


(3 3 Noo (3 3 (p x pyC+p x p y ) 

For a gas of hard spheres of radius R, Eq. (1 1.223) yields for the shear viscosity 

5 ImksT 


1 — 


64 R 2 


7 r 


(11.224) 


TABLE 11.2. Viscosity, tj , in Units of 10 s poise for Some 
Monatomic gases at Pressure P = 10 s Pa. [14] 


Gas 

r] (200 K) 

7 ] (300/0 

7 ] (400/0 

Argon (Ar) 

15.9 

22.9 

38.8 

Helium (He) 

15.3 

20.0 

24.4 

Krypton (Kr) 

17.1 

25.6 

33.1 

Neon (Ne) 

24.3 

32.1 

38.9 

Xenon (Xe) 

15.4 

23.2 

30.7 
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This expression is correct to within a few percent (cf. Ref. 6, page 168). Values 
of the viscosity for some monatomic gases are given in Table 11.2. 

It is interesting to compare the viscosity and thermal conductivity. If we take 
the ratio we find K/rj= 15kjs/4m = | cy, where c v = 3kg /2m is the specific 
heat of an ideal monatomic gas. The ratio K/r)c v = | is called the Euken 
number. One can check to see if this relation is satisfied by the monatomic gases 
in Tables 11.1 and 11.2. 


^ SPECIAL TOPICS 

► S1 1 .A. Beyond the Boltzmann Equation 

Computer experiments and a careful study of correlation functions have shown 
that the picture of transport phenomena given by Boltzmann is not completely 
correct. It is now realized that many-body processes play a very important role 
in determining of the long-time behavior of the correlation functions, and, at 
best, Boltzmann’s equation can only describe short-time processes. The 
Boltzmann equation predicts exponential decay of the correlation functions, 
but simple hydrodynamic arguments give a decay of the form, t ~ d / 2 (long-time 
tail), where d is the dimension of the system. These long-time tails have been 
observed in computer experiments and are now well-established theoretically. 
Long-time tails in the velocity autocorrelations function of a Brownian particle 
were derived in Section (S10.G). 

The many-body processes which give rise to the long-time tails have forced a 
change in our analysis of linear hydrodynamics. It was long thought that 
corrections to the Navier-Stokes equations could be obtained by expanding the 
macroscopic currents in terms of higher-order gradients, thereby introducing 
new transport coefficients to account for shorter-wavelength effects. However, 
this is now known to be incorrect. Microscopic calculations have shown that the 
transport coefficients in the so-called Burnett order (coefficients of the fourth- 
order gradients) and higher order are infinite. Therefore, a simple power series 
expansion of the hydrodynamic frequencies in terms of the wavevector will not 
work. The hydrodynamic frequencies are nonanalytic functions of the wave 
vectors. 

Transport coefficients can be computed by integrating the appropriate time 
dependent current correlation functions over an infinite time interval, from 
t = 0 to t = oo [cf. Eq. (11.190)]. Therefore, if the transport coefficients are to 
be finite, the correlation functions must decay sufficiently rapidly in time. For a 
long time it was a common view that the correlation functions would decay 
exponentially since this is what is predicted by the Boltzmann equation. 
However, in 1967, Alder and Wainwright [15], in a computer calculation of the 
velocity autocorrelation function for a gas of hard spheres, found that the decay 
of the autocorrelation function was exponential only for a few mean free times 
and that after a longer time the decay went as t ~ 3 / 2 (in three dimensions). In 
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Fig. 11.10. A plot of the difference between the velocity autocorrelation function 
obtained from molecular dynamics calculations for hard spheres and that given by 
Boltzmann theory. The correlation function is plotted as a function of mean collision 
times for 108 particles at V /Vo = 1.6, 3, and 20 (based on Ref. [15]). 


other words, it had a long-time tail. In Fig. 11.10 we give their results for 
various densities. The figure gives the difference between the exact velocity 
autocorrelation function for a hard-sphere gas and that computed using the 
Boltzmann equation. Both are plotted as a function of the number of collisions 
that have occurred. The different curves correspond to different particle densities 
(V 0 is the close-packed volume). We see that after many collisions have occur- 
red, the correlation functions decay away very slowly, as r 3 / 2 . The long-time 
tails are now known to be due to the long-lived hydrodynamic modes of the fluid. 
This can be shown using hydrodynamic arguments [16] (see Section S10.G). 

Expressions for transport coefficients obtained using the Boltzmann equation 
are limited to systems with very low density. If we wish to obtain expressions 
for the transport coefficients at higher densities using microscopic theory, we 
must begin by writing them in terms of a virial expansion (expansion in powers 
of the density). The first attempt in this direction was made by Bogoliubov [17], 
who gave a prescription for obtaining a virial expansion for the transport 
coefficients, but the first actual calculation was done by Choh and Uhlenbeck 
[18]. The Boltzmann equation includes only effects of two-body collisions. 
Choh and Uhlenbeck computed the contribution to the transport coefficients due 
to three-body processes. Their result was well-behaved. Somewhat later, a 
number of authors attempted to extend calculations of the transport coefficients 
to higher order in density. In so doing, they found that all higher-order terms are 
infinite, and therefore the expansion of the transport coefficients in powers of 
the density is divergent and ill-defined. The divergences in the virial expansion 
come from secular effects. Resummation of divergent terms [19] leads to well- 
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defined expressions for the transport coefficients but introduces a nonanalytic 
density expansion for them. The divergences are related to the appearance of 
long-time tails in the correlation functions. One of the simplest correlation 
functions we can compute, and the one measured in the molecular dynamics 
calculations of Alder and Wain wright, is the velocity autocorrelation function. 
We will discuss briefly how a virial expansion for this quantity is introduced. 

If we consider a classical system of N particles which interact via a 
spherically symmetric potential, the Hamiltonian has the form 

H A '(X") = f;^ + X;v(|q j - % |), (11.225) 

i=l i<j 


and the Liouville operator has the form 

I N (x N ) = V ^ d V dVij (— 

{ } m dq t Zjdq/ydp, dpj)' 


(11.226) 


If we tag particle 1, the probability density of finding particle 1 at point r at time 
t is 


P . (r, t) = <«5(q, « - r)> = J- • • J dX, • • • dX N p e ,(\ N ) e lL “ 6( q, - r), 

(11.227) 

where q t is the position of particle 1 at time t = 0, and ^(X^) is the 
equilibrium probability density 


p eq (X N ) = 


,-m N 


(11.228) 


(11.229) 


/••• f dXi-'-dXNe-P 1 "' 

The velocity autocorrelation function is given by 

Cw = (v(0,0)v(r, t)) = (PiSfqJPifrWq x (t) - r)) 

= [ • • • f dX, - dX N p e ,(\ N ) Pl «(q, ) e ,L " Pl tf(q, - r). 

It is most convenient to work with the Fourier-Laplace transform 
5w(k, z) = | dte~ zt j dre ,k r C vv (r, t). 

If we expand in the time evolution operator, e tL , in a power series, integrate 


(11.230) 
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each term by parts, and resum it, we obtain an expression in which the time 
evolution operator acts on the phase function to the left. We then find, after 
some simplification, 


S,v(M - m2 v 


dte 


a E| 

k, J 


X Pl^ 


,kq > e' tL " p eq (: X N )e 


d X] • • • d Xjy 
Pi, 


(11.231) 


where the minus sign on the evolution operator comes from the integration by 
parts. 

Let us now rewrite S vv (k,z) in the form 


Syy (k, Z ) ? 

m z 


dViPiX(PuKz)Pu 


(11.232) 


where 


x(Pi;k,z) = 


1 


dte ~ zt Y j d( li J dX.-.^dX, 

xe ik *e- tL " p eq (X N )e~ ik ^. 

We can write the operator, x(Pi;k,z), as a virial expansion: 

oo 

x(Pi;k,z) = Y nl xi(P\-,Kz). 

1=0 


(11.233) 


(11.234) 


To find a microscopic expression for the virial coefficients, x/(Pi ; k, z), we first 
introduce a cumulant expansion for the evolution operator, 


e~‘ L “ = S,( 1 „ N) = exp ^ U t (l , . . . , 1) V (1 1 .235) 

where U t ( 1, . . . , /) is an l - body cluster operator. The cluster operators can be 
expressed in terms of the evolution operator through the following hierarchy of 
equations: 


u,(l) =5,(1), (11.236) 

t/,(l,2)=S,(l,2)-S,(l)S,(2), (11.237) 

( 3 ) 3 

£/,(l,2, 3) = 5,(1, 2, 3) - S, (1)5, (2, 3) + 2! S,(0. 

P 1=1 


(11.238) 
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and so on, where denotes the sum over all j distinct permutations of the 
particles. If we isolate those cluster operators which depend on the coordinates 
of particle 1, we can write the evolution operator in terms of the following 
hierarchy: 

S,(l) = f/,(l), (11.239) 

S,(l,2) = U t (l)S t (2) + If, (1,2), (11.240) 


( 2 )' 

S,(l,2,3) = J/,(l)S,(2,3) + £ I/ r (l,2)S,(3) + £/,(l,2,3), (11.241) 

P 

and so on. The prime the summation, > means that particle 1 is not to be 
included in the j distinct permutations. 

We now substitute Eqs. (1 1 .239)— (1 1.241) for the case of N particles into Eq. 
(11.233) and we obtain 




ki 


dte 


-zt 


d<{\ 


dX 2 


dX f 


{N- 1)' 


|l/ f (l)S f (2,...,A0 + £ U t (l,2)S t (3,...,N) 

(N-l)(N-2)/2' \ 

+ £ 3 ) 5 >( 4 > • • ■ - N ) + ■ • T . 

(11.242) 

Equation (11.242) simplifies somewhat if we note that 



dX 2 ---dX N S,(2,...,N)p eq (X N ) = i (^) ' e-W lm . 

(11.243) 


Let us now use Eq. (1 1.243) and take the thermodynamic limit in Eq. (1 1.242). 
That is, we let N — > oo and V — * oo so n = N/V is finite. Also, (1/V) ]Ck, "" * 
[1/(27 t) 3 ] J dk\. We can then write Eq. (11.242) in the form 


* 

r oo 

r 1 

r k, j 

o 

1 

dqA 


i c°° 

X(Pi;k,z)=— j dki dte~ a </qJe ik - , "C/,(l)e- i, “'’'0(l) 
(27t) J Jo J I 


+ n | d X 2 e ik q ' £/ f ( 1 , 2)e~ iki Ql </>( 1 )</>(2) 

+ ^ J dX 2 J dX 3 e** U t { 1,2, 3)e~^ 0(1)0(2)0(3) + • • * 


(11.244) 
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Let us next note that 


(2tt) 


y 


f dki 

* OC f 

dte~ zt 

J J 

0 


Jq 1 e ,kq > f/,(l)<r ,ltiq > = 


(11.245) 


z - ik • p j/m 

Then the equation for x(Pi;k, z) takes the form of a virial expansion, 

1 00 

x(Pi ; k, z)^ -1 ( i ) = ^ — ^ ^x/(Pi ; K z)4>~ 1 ( l ) , (n.246) 


with virial coefficients, 


Xi(Pi;k,z) = 


d ki 


dte 


-zt 


d<h 


(27T) 3 

xe^U t { l,2)^- ,ki q > 0(1)0(2), 


</X 2 


(11.247) 


X2(Pi;k,z)=i 1 


2! (27t) : 


dk, 




Jqi 


</X 2 


JX, 


x e ikq ‘ t/,(l, 2, 3)e~ ,krqi 0(1)0(2)0(3), (11.248) 

and so on. 

As a final step we can use a resummation procedure introduced by Zwanzig 
[20] and write x(Pi',k,z) in the form 


x(Pi;k,z)<£ (1) = 


1 


z - ik • p i/m + S/=i « /fi /(Pi ; k, z) ’ 


(11.249) 


where 


m 


fi i(Pi;k,z) = z — — k • pj xi(Pi;k,z) ( z k • p, <j> *(1), (11.250) 


m 


fi 2 (Pi;k,z)= (z-^k-p 1 Jx 2 (Pi;k,z) (z - ^ k • pj <f> *(1) 
z- — k-Pj) Xi(Pi;k,z) (z-- k-p, 


x Xi (Pi ; k, z) ( z — — k • pj ) (f> *(1), 


(11.251) 


and so on. Note that p L and x(Pi; k,z) do not commute. We can now combine 
Eqs. (1 1.232) and (1 1.249) and write the velocity autocorrelation function in the 
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form 


'( Kz )=- 2 

m l 


JpiPi 


z - *k • Pi /m + E /=1 nl Bi{ Pi ; k, z) 


Pi 0(1). 
(11.252) 


The long-time hydrodynamic behavior of the correlation function is determined 
by the behaviour of ^/(pj ; k, z) in the limit z — * 0 and k — > 0. Microscopic 
expressions for 5;( pj ; k, z) have been obtained by several authors using a binary 
collision expansion. We will not attempt to repeat their results but only describe 
the essential features. 

The first-order term, (p x ; k, z), is well-behaved in the limit z — > 0 and 
k — > 0 and, in fact, reduces to the Lorentz-Boltzmann collision operator. 
Higher-order contributions were studied by Zwanzig [21] and Kawasaki and 
Oppenheim [19]. Zwanzig found that in three dimensions, fi 2 (p, ; k, z) is well- 
defined as z — » 0. Kawasaki and Oppenheim found that fi 2 (Pi;k,z) is well- 
defined for three dimensions and diverges as In (z) for two dimensions: 



well-behaved 

In (z) 


(3 dimensions) 
(2 dimensions).. 


(11.253) 


For all higher powers they found that the most divergent terms behave as 


*/(Pi,z) ~ 



(3 dimensions) 


(2 dimensions). 


(11.254) 


Kawasaki and Oppenheim showed that these most divergent terms in each order 
of the density expansion result from repeated collisions in the binary collision 
expansion of 5/(Pi,z). These repeated collisions can last a very long time and 
therefore lead to secular terms in the correlation function which diverge in the 
limit t — * oo. 

The relation between the divergences in the viral expansion of the transport 
coefficients and the long-time tails found by Alder and Wainwright [15] has 
been shown by Dorfmann and Cohen [21] and by Dufty [22]. The Boltzmann 
equation is unable to account for the long-time tails observed in the correlation 
functions. They, in fact, result from long-wavelength hydrodynamic effects in 
the system. When the most divergent terms in the virial expansion of the 
transport coefficients are summed together, they give contributions to the 
current-current correlation functions which decay as t ~ d ! 2 (where d is the 
number of dimensions) even for low density. To obtain this result it is necessary 
to retain contributions from collision processes involving all possible numbers 
of particles, even for low density. In practice, at very low density the many- 
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body effects give only a very small contribution to the transport coefficients and 
the Boltzmann equation is adequate. But in principle they are there and become 
more important as the density is raised. 

The fact that we must include many-body effects in our expressions for the 
time correlation function has important consequences for hydrodynamics. Until 
now we have only discussed the Navier-Stokes limit of the hydrodynamic 
equations. That is, we have kept terms to order k 2 in the linearized 
hydrodynamic equations. However, it is also of interest to go to higher orders 
in k and by so doing introduce new transport coefficients. For example, we may 
wish to write the diffusion equation in the form 

r\ 

= Dk 2 n + D 2 k 4 n + • • • . (11 .255) 

Corrections of this type were first introduced by Burnett [23] for the linearized 
hydrodynamic equations. By including contributions to the correlation function 
from higher powers in the density, Dufty and McLennan [24] have shown for 
diffusion processes that the transport coefficient, D 2 , does not exist (is infinite) 
and that simple corrections of the linearized hydrodynamic equations involving 
higher-order gradients are not possible. Similar results have been obtained by 
Ernst and Dorfmann [25] for the corrections to the Navier-Stokes equations. 
Thus, and ^-dependent corrections to the Navier-Stokes equations appear to be 
nonanalytic in k, and simple corrections in terms of higher-order gradients are 
not possible. 
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PROBLEMS 

Problem 11.1. A dilute gas of density n is contained in a cubic box and is in equilibrium 
with the walls at temperature T. Find the number of particles per unit area per unit time 
which collide with the walls and have magnitude of velocity greater than vo. 

Problem 11.2. Estimate the value of the coefficient of viscosity of argon gas at 25 °C 
and 1 atm pressure. Compare your estimate with the experimentally observed value of 
rj = 2.27 x 10“ 4 g • cm -1 - sec -1 . Argon has an atomic weight of 39.9 and at low 
temperature forms a closely packed solid with density p — 1.65 g/cm 3 . 

Problem 11.3. The number density in phase space of an ideal gas can be written 

/ (p , r) =£ (5^) + £ sin («) cos (»)). 

where 6 and 0 are the polar and azimuthal angles, respectively, of the momentum, p 
measured with respect to the z axis. 

(a) Compute the net number of particles, per unit area per unit time, that pass through 
the y-z plane. 

(b) Compute the net number of particles, per unit area per unit time, that pass 
through the x-z plane. 

Problem 11.4. The electrons in a cubic sample of metallic solid of volume L 3 and 
temperature T may be considered to be a highly degenerate ideal Fermi-Dirac gas. 
Assume the surface of the cube forms a potential energy barrier which is infinitely high. 
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and assume that the electrons have spin s = f, mass m, and charge e. (a) What is the 
number of electrons in the velocity interval, v — > v + dvl (b) Assume that an electrode 
is attached to the surface which lowers the potential energy to a large but finite value, W, 
in a small area, A, of the surface. Those electrons which have enough kinetic energy to 
overcome the barrier can escape the solid. How many electrons per second escape 
through the area, A? Assume that W » 0 and use this fact to make simplifying 
approximations to any integrals you might need to do. 

Problem 11.5. Find the differential scattering cross section, in the lab frame, for two 
identical hard spheres of diameter D and mass m. 

Problem 11.6. (a) Prove that the linearized Boltzmann and Lorentz-Boltzmann 
collision operators, C+ and C~, respectively, are self-adjoint, (b) Prove that the scalar 
products, ($, C + <f>) and (<h, C“$), are always less than or equal to zero for arbitrary 
functions, $ = $(p). 

Problem 11.7. An approximate expression for the Boltzmann collision operator, C, is 

5 

C= -7I +7 ^ \<t>i 

i= 1 

where |0,-) are the five orthonormal eigenfunctions of C with eigenvalue equal to zero. 
The five eigenfunctions are |0i) = |1), |0 2 ) = \ p x ), \<j> 3 ) = \ p y ), |0 4 > = \p z ), and 
|05) = |f - (/3p 2 /2m)). They are orthonormal with respect to the scalar product, 

<0(P)|X(P)> = ) | dpe~P p2 / 2m 0(p)x(p)- 


(a) Compute the coefficient of shear viscosity 




and the coefficient thermal conductivity, 
nk B 


K — — lim 

£— *0 m z 




C -j~ £ 


Px 


2m 


where n is the number density of particles in the gas described by C and m is the 
mass of the particles. 

(b) What are the units of 7 ? What are the units of 7 ? 

Problem 11.8. The coefficient of diffusion can written in the form 


D = - lim — 5 
e— *0 m l 



1 


(C<">+£) 


Px, 


where Cp -) is the Lorentz-Boltzmann collision operator. Replace by an effective 
collision operator, 7p~\ where Jp~ ] acting on an arbitrary function, h(p,r, t), has the 
form 


•/£ ^(P> r > 0 = -iHp, r, t) + 7 
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(a) Show that is self-adjoint and negative semidefinite and has one eigenfunction 
with eigenvalue zero. 

(b) Use 7p to obtain an explicit expression for the diffusion coefficient (do all the 
integrals). 

(c) What are the units of 7 ? 

Problem 11.9. Write the linearized Boltzmann equation for a dilute gas of interacting 
particles. 

(a) Starting from this linearized Boltzmann equation, derive five linearized 
hydrodynamic equations. 

(b) From the information in part (a), write a microscopic expression for the 
momentum current. 

Problem 11.10. The coefficient of self-duffusion can be written 

D =4(^n^ /2> ' A '’ 

where A* = YlHo dip x SL 2 [ftp 1 J2.m). Compute the coefficient of self-diffusion, D, for a 
gas of hard-sphere particles of radius R, keeping only the lowest-order term (/ = 0) in 
the expansion for A x . 

Problem 11.11. The coefficient of shear viscosity can be written 

L)' l dfe '"' 1,2mp - p ^’ 

where B ^ b/ PxPy S l 5 / 2 {ftp 2 /2m). Compute the coefficient of shear viscosity, //, 

for a gas of hard-sphere particles of radius R, keeping only the lowest-order term (/ = 0) 
in the expansion for B^. 
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NONEQUILIBRIUM PHASE 
TRANSITIONS 


12.A. INTRODUCTION 

In previous chapters we considered phase transitions that occur in systems in 
thermodynamic equilibrium. In this chapter we will give examples of phase 
transitions which can occur in chemical and hydrodynamic systems far from 
absolute thermodynamic equilibrium. The systems we will consider are locally 
in thermodynamic equilibrium, so the state of the system can be described in 
terms of thermodynamic densities which vary in space. However, they are far 
enough from absolute equilibrium that nonlinear effects must be included in the 
chemical rate equations or hydrodynamic equations. As we shall see, nonlinear 
effects open a whole new world of behavior for such systems. Nonlinear 
equations allow the possibility of multiple solutions, each with different regions 
of stability. Thus, as we change the parameters of a nonlinear system, it can 
exhibit phase transitions from one state to another. 

Prigogine was first to show that near equilibrium (in the linear regime), if 
one of the thermodynamic forces is held fixed, the stable state of the system 
is a steady state characterized by a minimum entropy production and it is 
unique. This state is said to lie on the thermodynamic branch. However, as we 
move away from the linear regime, nonlinearities in the hydrodynamic 
equations or chemical rate equations become more important and at some point 
the thermodynamic branch becomes unstable and a nonequilibrium phase 
transition occurs. The system then changes to a new state which is characterized 
by an order parameter. Often the change is dramatic. Even if the boundary is 
held fixed and the steady state in the linear regime is homogeneous or has 
constant gradients, the new state that appears in the nonlinear regime can 
oscillate in space and/or in time and often exhibits nonlinear wave motion. 
Thus, the symmetry of such systems in space and/or in time is broken at the 
phase transition, and the new stable nonequilibrium state exhibits much more 
structure than the state on the thermodynamic branch or the state of thermo- 
dynamic equilibrium. These structured states (called dissipative structures by 
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Prigogine) require a flow of energy and sometimes a flow of matter and, there- 
fore, a production of entropy to maintain them. 

In this chapter we will illustrate nonequilibrium phase transitions with some 
classic examples taken from hydrodynamics and chemistry. Before we begin to 
consider examples, we will discuss how equilibrium stability arguments 
determine the stability of systems far from equilibrium. We will then consider 
two important nonlinear chemical reactions. In the first, the Schlogl model, we 
show that chemical systems can exhibit nonequilibrium phase transitions 
similar to the first-order transitions in the van der Waals system. That is, abrupt 
transitions in the concentration of one of the chemicals can occur. As a second 
example, we discuss the Brusselator chemical model introduced by Prigogine 
and Lefever. This is the simplest chemical model that can exhibit spatial and 
temporal dissipative structures — that is, oscillations in space and in time 
maintained by the flow of chemicals through the system. (It is interesting to 
note that many of the ideas of nonlinear chemistry can also be applied to 
population dynamics and ecological systems because, much like chemical 
systems, individuals continually interact and are changed by their interactions, 
thus causing dynamical evolution of the populations. Unfortunately, we do not 
have space to explore this fascinating subject here.) 

As a final example, we consider a purely hydrodynamic system and show 
that the nonlinearities in the Navier-Stokes equations can lead to instabilities in 
fluid flow. Turbulence is one example of such an instability. The example we 
shall consider is called the Rayleigh-Benard instability, and it occurs in a fluid 
layer in the presence of gravity when the layer is heated from below. When the 
temperature gradient becomes great enough, the fluid undergoes a transition 
from an isotropic stationary state to a state whose macroscopic behavior is 
dominated by stationary spatially periodic convection cells. The hydrodynamic 
equations can predict the parameter values at which this instability occurs. 

In the special topics section, we study the behavior of fluctuations in the 
neighbourhood of some instabilities in the Brusselator and Rayleigh-Benard 
systems. Near the critical point, but in the isotropic phase, it is possible to 
isolate an order parameter for the nonequilibrium phase transition. The order 
parameter corresponds to a mode in the nonequilibrium system which exhibits 
critical slowing down. One can then use this ‘critical’ mode to write a time- 
dependent Ginzburg-Landau equation which describes the dominant behavior 
close to the critical point. In this way one obtains a description of non- 
equilibrium phase transitions which has some analogies to the description of 
equilibrium phase transitions. 


12.B. NONEQUILIBRIUM STABILITY CRITERIA [1-3] 

Nonequilibrium systems are dissipative because of transport processes and 
chemical reactions which lead to an increase in their entropy. Prigogine [11 
showed that there are stability conditions associated with the rate at which 
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entropy increases in a nonequilibrium system. In this section we will discuss 
these stability criteria, first for systems near equilibrium and then for systems 
far from equilibrium. 

Let us consider the entropy balance equation for a closed isolated system of 
constant volume. For such a system no heat or matter can be transported 
through the walls. We can integrate the entropy balance equation [cf. Eqs. 
(10.10), (10.24), and (10.179)] over the volume, V, of the system and find the 
following equation for the total time rate of change of the entropy: 

r r (**'■*) 

= - j (ps\ + Jf ) • dS + j <7 s dV, 

where J 5 denotes integration over the surface of volume, V. Since no energy or 
matter can flow through the surface, the contribution from the surface integral is 
zero. Thus 


dS 

dt 


* 

a s dV 
v 


(12.2) 


is the rate of entropy production due to spontaneous processes in the system. 
The second law of thermodynamics tells us that for reversible processes the 
entropy of the system does not change, but for irreversible or spontaneous 
processes it must increase. Therefore, 


dS 

dt 


ex s dV > 0 
v 


(12.3) 


and the local entropy production, a s (time rate of change of entropy per unit 
volume), must be positive. The local entropy production can be written in the 
form 


^ = (12.4) 

I 

where r / i is a local current, or flux, and Xj is a local force [cf. Eqs. (10.26) and 
(10.179). We will now examine the behavior of this quantity both near 
equilibrium and far from equilibrium. 


12.B.1. Stability Conditions Near Equilibrium 

As we have seen in Chapter 10, near equilibrium the currents, J [ i , may be 
approximated by a linear relation of the form 

c/i = ~ LijXj, 


(12.5) 
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where Ly is a symmetric matrix, Ly = (Onsager’s relation). If we substitute 
Eq. (12.5) into Eq. (12.4), we find in the linear regime 

** = ££ LijXiXj — 0 ( 12 . 6 ) 


[cf. Eq. (12.4)]. Therefore, for a system in which spontaneous processes take 
place, the phenomenological coefficients form a symmetric positive definite 
matrix (cf. Section 2.H). 

We can now ask under what conditions the entropy production will be 
minimum. For simplicity, let us first consider the case with two currents and two 
forces: 


2 2 

°'»=5I£ z *XiX/ >0 > < 12 - 7 ) 

i=i «=i 

where we have taken the forces to be independent variables. The fluxes are then 
dependent. If we allow the forces to vary freely, the condition for minimum 
entropy production is given by the equations 

(|£) = 2L„xi + Ihnxi = -2/, = 0 (12.8) 

and 

= Ihnxi + 2L2ixi s -2/ 2 = 0 (12.9) 

Xi 

(Note that we have assumed that Ly is constant.) When there are no constraints 
on the forces, the state of minimum entropy production is the equilibrium state. 
The entropy production will be minimum (zero) when all forces and fluxes 
vanish. 

Let us now hold one force fixed. That is, we let xi = constant (this could 
be the case of a system with a constant temperature gradient maintained 
across it). Then the condition for minimum entropy production becomes 
simply 


( d(T s 

\dX2 


= llmXi + 2L2,xi = —2^2 — 0 (12.10) 

Xi 

If we hold xi — constant, then will be zero in a state of minimum entropy 
production, but ^ x will be nonzero and constant. Thus, the state of minimum 
entropy production will be a stationary state. This fact was first established by 
Prigogine [1]. 
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We can generalize this result to an arbitrary number of forces and currents. 
Let us consider a system with n independent forces x;(i = 1, •••,«)• The 
entropy production is 




n 


LijXiXj- 

i,j= 1 


( 12 . 11 ) 


Let us now hold the forces x»( i=l,...,fc) constant. The condition for 
minimum entropy production becomes 


dcrA = 

@Xi) 


n 


(Ljj + Lji)xj — 0 

j= i 


(12.12) 


(i = jt + 1, . . . , n). Using Onsager’s relations, Eq. (12.5) becomes 

2/ ( = 0 (12.13) 


for (i = k -f 1 , . . . , n). In other words, if we hold x/(* = 1 , . . . , k) fixed, then 
< / i ■ = 0(i = k + 1, . . . , n) when the system is in a state of minimum entropy 
production, and the currents fi(i= 1, . . . , k) will be constant. Such a state is a 
Mi-order stationary state. 

In the linear regime, the stationary states are always stable. That is, 
fluctuations cannot drive a system away from the stationary state. (This is no 
longer true in the nonlinear regime.) Let us consider a system specified by n 
independent forces. We will hold forces xi , • • • , Xk fixed and allow fluctuations 
to occur in forces Xk+i , • • • , Xn- Thus, we consider the entropy production to be 
a function of the forces Xk+i , . . . , x«- We will let x*+i , • • • , Xn denote the values 
of the forces x*+i, • . . , x« in the steady state. The entropy production near the 
steady state can be expanded in a Taylor series of the form 


cr, 


n / q \ 0 

(Xk+ 1 , . . . , Xn) = Vs{x°k+ 1 > • • • ’ x2) + S 

i=k+l \ a XiJ {xj^xi) 


!AAf d fda s \° 
+ 2^ dXi W/i 


i o 


[d Xi KdXjJ^xj } J 


WXi + 




= Cr J (x? +1 ,.--,Xn)+ j EE WX; + -. (12.14) 


«=1 7=1 


The quantity 



>0 


(12.15) 
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is called the excess entropy production and is always positive since the 
coefficients Ly form a positive definite symmetric matrix. Since the stationary 
state is a state of minimum entropy production, we also have 

dv Y \ dvy gySfiSfj < 0, (12.16) 

V y.j+1 dt J V y =4+1 

where we have used the fact that <5/ = L -1 • M • 6<x — g • Set and 6} = d6ct/dt 
(cf. Section 10.D). Thus, the excess entropy function, A P, is a Liapounov 
function (cf. Appendix C). 

Equations (12.15) and (12.16) guarantee the stability of stationary states in 
the linear regime. In the nonlinear regime, Eqs. (12.15) and (12.16) need not be 
satisfied and we have the possibility of forming unstable stationary states as 
various parameters are changed and of having phase transitions far from 
equilibrium. 

12.B.2. Stability Conditions Far From Equilibrium 

As we have seen above, it is possible to create a nonequilibrium steady state by 
holding some thermodynamic forces, such as the affinity or gradients of various 
densities, fixed. In the linear regime, the steady state which results is a state of 
minimum entropy production and is always stable. In the nonlinear regime, we 
can no longer write the currents as linear functions of the forces and it is no 
longer possible to prove that the steady state will continue to be stable in the 
nonlinear regime. Thus, we have the possibility of creating nonequilibrium 
steady states which become unstable as we vary certain parameters of the 
system, and the system can evolve to a new state which does not exist in the 
linear regime. That is, it can undergo a nonequilibrium phase transition. 

12.B.2.1. General Arguments 

A thermodynamic stability theory for systems far from equilibrium has been 
developed by Glansdorff and Prigogine [2]. When we consider systems for 
which the linear approximation is no longer valid, we can no longer show that 
the time rate of change of the entropy production has a definite sign, but we can 
show that part of it has a definite sign. If we write the entropy production in the 
form 



P 


dvy/tXi, 

i 


(12.17) 


its time derivative can be written 
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In the linear regime we know that 


I I 


(12.19) 


/n nonlinear regime for systems in local equilibrium, it is possible to prove 
that the relation 


is always true because of thermodynamic stability relations which hold in each 
small region of the system, but it is not in general possible to say anything about 
the sign of dPj/dt, where 


djP 

dt 



( 12 . 21 ) 


However, under certain conditions, if we can establish one additional property 
for the system, Equation (12.20) enables us to say something about the stability 
of a nonequilibrium steady state. 

In order to illustrate the use of thermodynamic stability theory for 
nonequilibrium steady states, we shall consider the case of a chemically 
reacting system of N types of molecules which is held far from equilibrium (it 
has large affinity). We shall assume that the effects of diffusion, viscosity, and 
thermal conductivity can be neglected and that the pressure and temperature are 
uniform throughout the system. The entropy production for such a system can 
be written 


P 


1 

T 


dV^cAc 

a= 1 


(12.22) 


[cf. Eq. (10.179)], where r is the number of reactions. Therefore, 


d*P 

dt 




(12.23) 


From Eq. (10.179), we can write the balance equation for the jth molecular 
component in the form 


dcj 

dt 



a— l 


(12.24) 
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If we now use the definition of the affinity, A a = J2jL\ VjaHj, of the alh 
reaction, we find 


dt T 4-f dt dt' 

J 7=1 


(12.25) 


Let us next note that since pressure and temperature are held constant 


dt 



(12.26) 


and Eq. (12.25) becomes 


dxP 

dt 


1 

T 


N N 


i= 1 7=1 


\d c i) PJ ^ Ci} dt dt ~ 


(12.27) 


To establish the inequality we have used the stability condition in Eq. (2.179). 
Since we assume that the system is in local equilibrium, equilibrium stability 
conditions ensure that d x P/dt always has a well-defined sign. 

12.B.2.2. Stability of Nonlinear Chemical Reactions 

Thermodynamic stability theory for systems undergoing nonlinear chemical 
reactions is of considerable interest because of the importance of such reactions 
in biological systems. The approximation in which the rates of reaction, J c a , are 
driven by terms linear in the affinities is true only near equilibrium. If we wish 
to examine the behavior of chemically reacting systems far from equilibrium, 
we must first find the exact dependence of reaction rates on the affinities. 
This can be done as follows. Let us consider a general chemical reaction of 
the form 


—V\A\ — i y 2 A 2 ^ zVh + zVU. 

*2 


The rate equation can then be written 


dc i - t.-hUnl 
di ” 11 Cl 


- k 2 cf 


= hc\ 


^il J^l 


1 - 


fee !* 1 


>4f 


k\c[ 


hL 


(12.28) 


(12.29) 


where c ( = «,/V is the molar density. The rate of reaction, J c , is determined 
from J c = (l/v){8ci/dt) [cf. Eq. (12.24)]. For system of ideal gases, we can 
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write the affinity in the form 

(12.30) 

[cf. Eq. (2.244)], where K(T,P) is a function of temperature and pressure only. 
At equilibrium, A = 0 and dcj/dt = 0. If we compare Eqs. (12.29) and (12.30), 
we see that K(T,P) = and 

~ = v 3 J c = kiC^c^ll - e A/RT ], (12.31) 

Thus, Eq. (12.31) relates the reaction rate, J c , to the affinity, A. In general, J c is 
a nonlinear function of A. For small A, J c becomes linear in A. 

Let us now look at the stability problem for a case in which only chemical 
reactions take place. To determine the stability of a given steady state we can 
expand the local entropy around that steady state. The result, schematically, is 
of the form 


A=RT\n K(T,P) 


c 3 c 4 


,hLM 

'1 C 2 j 


Slocal ~ Sfocal + $ $ local + & S local + ' " • (12.32) 

The various terms in Eq. (12.32) are identical inform to those obtained for the 
Taylor expansion about absolute equilibrium. However, Eq. (12.32) differs from 
the equilibrium case because we expand about a nonequilibrium steady state 
where entropy is produced at a constant rate rather than about the absolute 
equilibrium state for which entropy has fixed value. The second-order term is of 
the form 


I * JV/o \ o 

Ss i«ai = -^EE[f L ) Sc,Scj< 0. (12.33) 

11 ,= 1 j= 1 \ ac jJ P,T,{c k ^Cj} 


Since we assume that the system is in local equilibrium, the equilibrium 
stability conditions tell us that 6 2 Si oca i will be negative or zero at each point. 

We can now find the time rate of change of the local entropy due to terms 
which are quadratic in the local fluctuations about the steady state. If we take 
the derivative of Eq. (12.33), we find 



^local — 


dV 


N N 

fEE 

L 1 i = 1 M 


(mx ic d M 

Wwfa*,} * j 


(12.34) 


where we have used Maxwell’s relation, Eq. (2.115). We can use Eq. (12.23) to 
write 


88 Ci 

dt 




Q— 1 


(12.35) 



730 


NONEQUILIBRIUM PHASE TRANSITIONS 


Furthermore, if we note that 



(pressure and temperature are assumed to be fixed) and that 

N 

8A a — ^ ^ 

i=l 


we obtain 


d8 2 Si oca \ 

dt 



(12.36) 


(12.37) 


(12.38) 


Thus, the time variation of the second-order contribution to the entropy 
production due to fluctuations about the nonequilibrium steady state is given by 
a product of fluctuations in the current and forces. The function, 8 2 Si oca i, may be 
viewed as a Lyapounov function (cf. Appendix C). If we can show that 
8 2 S i oca i < 0 and d8 2 S[ oca i/dt > 0, then the steady state we are considering is 
stable. Since the condition 8 2 Si oca i < 0 is always true, the condition that must 
be established is d8 2 Si oca i/dt > 0. 

Let us now compare Eq. (12.38) with the excess entropy production, 8 X P. 
The entropy production itself is given by 


P 




(12.39) 


and the contribution to the entropy production due to fluctuations of the 
independent forces about their equilibrium value is 


8 X P=~ 



’l 

T 


a=\ 


(12.40) 


In the steady state, currents corresponding to the independent (unconstrained) 
forces vanish. Therefore, we can make the following expansion of J c a about the 
steady state, 

^ = 0 + <5 J c a + 6 2 J c a + ---. (12.41) 


For very small deviations from the steady state, Eq. (12.40) becomes 


8 X P 


\dv 

J a=l 


dt 


(12.42) 
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If the steady state is to be stable with respect to small fluctuations, then the 
fluctuations must cause the entropy production to increase since the entropy 
production for a stable steady state is minimum. An example of how these 
criteria are used is given in Exercise 12.1. 


■ EXERCISE 12 . 1 . Consider the autocatalytic nonlinear chemical reaction 

B + X^2X. 

*2 

The affinity can be written A = RT\n(k\c\/k 2 CBCx), where c$ (c*) is the 
molar density of chemical B(X). (a) When the molar densities are allowed to 
vary freely, what steady states can occur? Are they stable? What is the state 
of thermodynamic equilibrium? (b) Hold the molar density of B fixed at c B 
j by allowing molecules of B to flow in and out of the system. Determine if the 
| steady states are stable. 

| Answer: 

(a) This chemical reaction is nonlinear because it requires X to produce 
X. The rate of the reaction is 

JC = ■ = h c BCx ~ hc 2 x . (1) 

at 

Steady states, c x = c x and q? = Cb, occur when dcx/dt = 0. There- 
fore there are two possible steady states. They are c x = kiCs/kj and 
cx — cb = 0. The second steady state is uninteresting since there are 
no chemicals in the system. The first steady state, c x = k\Cs/k 2 , is a 
state for which the affinity, A = 0. Therefore, it is a state of 
thermodynamic equilibrium. 

(b) Hold the molar density of B fixed at c B = c° B , by allowing B to flow in 
and out of the system. The steady states are then c x — k\c B /k 2 and 
c x = 0. To determine which is stable, consider a small fluctuation, 
6c x, about the steady state so c x = c x + 6c x . The fluctuation in the 
rate of reaction (to lowest order in 6c x ) is 6J = {k\c° B — 2k2Cx)6c X - 
The fluctuation in the affinity is 6A = RTScx/cx- Therefore, the 

j excess entropy production is 

; d x p = Uj c 6A = - 2hc x )(tcxf. (2) 

r Cx 

Since d x P > 0 for steady state, c x = k\c° B /k 2 , this steady state is 
j stable. Since d x P < 0 for steady state, c x = 0, it is unstable. This j 
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reaction is a little too simple to be very interesting, but it illustrates 
the point that nonlinear chemical reactions can have multiple steady 
states when held out of equilibrium. 


In general, as we increase the affinity of a chemical system in which 
autocatalytic (nonlinear) chemical reactions occur, the system will undergo a 
series of phase transitions. The first transition occurs from the near equilibrium 
steady state to a new state which cannot exist near equilibrium. Then, in 
general, there will be transitions between a whole series of states, each of which 
becomes stable and then unstable as the parameters of the system are changed. 

The possibility of having phase transitions to more ordered states far from 
equilibrium opened a new field of research in the chemical and biological 
sciences. For example, living systems may be thought of as chemical systems 
(although of huge complexity) maintained far from equilibrium. There is now 
some evidence that the formation and maintenance of ordered states far from 
equilibrium in such systems is important for the maintenance of life processes. 
In 1977, Ilya Prigogine received the Nobel prize in chemistry for laying the 
foundations of this field of chemistry and for his many contributions to its 
development. 

In the subsequent sections we shall consider some more interesting chemical 
reactions than those which were considered here. We will generally use the 
linear stability theory discussed in Appendix C, because it is simpler to apply 
than the Lyapounov approach discussed in this section and it is sufficient for the 
purposes of this chapter. However, stability conditions for the reactions 
discussed in the next two sections can be obtained using the methods of this 
section and therefore can be fitted into the framework of thermodynamics. 


12.C. THE SCHLOGL MODEL [4] 

The first nonlinear chemical reaction we will consider was introduced by 
Schlogl [4] and exhibits a phase transition which is analogous to the first-order 
phase transition predicted by the van der Waals equation. The Schlogl model is 
given by the nonlinear reaction scheme 

A + 2X^3X and X^B. (12.43) 

Thermodynamic equilibrium occurs when 

kic e A q {c?f = k 2 (4"f and hc‘ q = k a c‘ q , (12.44) 

where c e A q , Cg q , and are the equilibrium molar densities of molecules A, B, 
and X, respectively. At thermodynamic equilibrium, the ratio of the equilibrium 
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densities, c eq and c B q , is fixed, 

_C_l_k4k2 
Keq ~c e B q hk i 


(12.45) 


and is determined by the rate constants. 

If we now hold ca and eg fixed at some nonequilibrium values, ca — c® and 
c B = c B , and allow c x to vary, the rate equation for c x can be written 


—^ = k x c%c 2 x -k 2 c\-k 3 c x + k A Cg. (12.46) 

The steady states occur when dc x /dt = 0 and therefore for values molar 
density, c x = c x , which satisfy the equation 

c x — ac x + kc x — b = 0, (12.47) 

where a = k\C®/k 2 ,b = k 4 c B /k 2 , and K = k 3 /k 2 . If we fix the ratio 
R = c°/cj] 7^ R eq , the steady state, c x , need not be a state of thermodynamic 
equilibrium. 

Equation (12.47) is a cubic equation, as is the van der Waals equation, and 
has similar properties. In Fig. 12.1 we sketch c x as a function of h for fixed 
values of a and k. In the parameter range b- < b < b + , three steady-state 
solutions exist (they can be realized in nature, only if they are stable). The 
values, b _ and b + are easily found. They are the extrema of the curve, 
b = b(c x ), and are obtained from the roots of the equation 

—r — 3 cl — 2 ac x + k = 0. (12.48) 

dc x 

The two roots, c x ± , of Eq. (12.48) are given by 

c x =|(ai \/a 2 - 3 «). (12.49) 





6 _ 


b 


Fig. 12.1. A sketch of the line of steady- 
state solutions for the Schlogl model. 
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The values, b± = b(c£), can be found from Eq. (12.47). For a > %/3/c, there is 
a range of values of b(b- < b < b + ) for which the system can have three 
different steady states. For a < \/3k, it can have only one steady state for each 
value of b. 

From linear stability theory (cf. Appendix C) it is possible to show that the 
region QP in Fig. 12.1 is unstable while regions P'Q and PQ are stable. We can 
let cx{t ) = cx + 6cx(t), where 6cx(t) is a small fluctuation about the steady 
state, cx . If we then substitute c x (t) into Eq. (12.47) and keep only terms first 
order in 6cx(t), we obtain a linearized equation for 6cx(t). We can solve this 
linearized equation and find that 8cx{t) decays exponentially in regions P'Q and 
PQ in Fig. (12.1), and it grows exponentially in regions PQ. Since fluctuations 
are always present, the system will be driven away from the steady state in the 
region PQ and it cannot be found in nature. 

Let us consider Fig. 12.1 again and move along the steady state from 
point P t to point Q by increasing b. When the state of the system reaches Q, 
it must jump from a steady state with molar density, cf% to a steady state 
with molar density, cf , for values of b>b + . Thus the chemical system 
appears to undergo an abrupt transition at the value b = b+. The system also 
exhibits hysteresis behavior. As we decrease b, we remain on the curve 
PQ until we reach the value b = b-. Then the system undergoes an abrupt 
transition from a steady state with molar density, c£, to a steady state with 
molar density, c£ ' ■ In Fig. 12.2 we have plotted the hysteresis region as a 
function of a and b. Such behavior has been used as a model for explosive 
reactions. 

The rate equation (12.47) is a macroscopic equation for the molar density of 
the molecular species X. It is possible to do a stochastic analysis of this reaction 
and to obtain some of the critical exponents of the reaction [5]. 



Fig. 12.2. Hysteresis region for the Schlogl model plotted as a function of a and b. 
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12.D. THE BRUSSELATOR [2, 3, 6-8] 

Chemical systems which are locally in thermodynamic equilibrium but held far 
from chemical equilibrium (constrained to have a large affinity) can undergo 
phase transitions to new stable states with striking behavior. The new stable 
states may be steady states in which the relative concentrations of the 
constituents vary in space, or they may be spatially homogeneous states in 
which the concentrations of some constituents vary in time (chemical clocks), 
or they may be states with nonlinear traveling waves in the concentrations of 
some constituents. The classic example of such behavior is the Belousov [9]- 
Zhabotinski [10] reaction. This reaction, which is too complicated to discuss in 
detail here (cf. Ref. 11), involves the cerium ion catalyzed oxidation of malonic 
acid by bromate in a sulfuric acid medium. The reaction is nonlinear because it 
contains autocatalytic steps. The system, when it is well-stirred, can behave like 
a chemical clock. That is, there is a periodic change in the concentration of Br- 
and of the relative concentration Ce 4+ /Ce 3+ . The system oscillates between red 
and blue color with a period of the order of a minute. Perhaps the most 
fascinating behavior of this system is the traveling waves in the concentration of 
Br _ and the relative concentration Ce 4+ /Ce 3+ which are observed in shallow 
unstirred dishes (cf. Fig. 12.3). 

The Belousov-Zhabotinski reaction appears to be well-described by a model 
which contains three variable intermediates. This model is called the 
Oregonator and was first introduced by Field and Noyes [11, 12], However, 
the qualitative behavior of the type appearing in the Belousov-Zhabotinski 
reaction also occurs in a simpler model called the Brusselator, first introduced 



Fig. 12.3. Traveling waves in the chemical concentrations for the Belousov- 
Zhabotinski reaction. Reprinted, by permission, from A. T. Winfree, The Timing of 
Biological Clocks (Scientific American Books, 1987). 
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by Prigogine and Lefever [6], which contains two variable intermediates. In this 
section we shall discuss some properties of the Brusselator, using linear stability 
theory (cf. Appendix C). 

12.D.1. The Brusselator — A Nonlinear Chemical Model 

The Brusselator is one of the simplest models of a nonlinear chemical system 
for which the relative concentration of the constituents can oscillate in time, or 
can exhibit nonlinear waves. It has six different components, four of which are 
held fixed and two others whose concentrations can vary in space and time. The 
chemical reaction takes place in four steps and is held far from equilibrium by 
allowing the reactions to go in one direction only. The four steps are 

A^X 

B + X^Y + D, (12.50) 

2X + Y — 3X, 


and 


In practice, A and B are present in excess and D and E are removed as soon as 
they appear. The rate equations for X and Y can be written 


^ = k x c A - (k 2 c B + h)c x + hc\c Y + D\ Vj c* 


(12.51) 


and 


— = k 2 c B c x - k 3 c 2 x c Y + D' 2 Vlc Y . (12.52) 

We have allowed the densities to vary in space and have allowed for the 
possibility of diffusion (Z/j and IX 2 are the coefficie nts of d iffusion). I f we n ow 
introdu ce a ch ange of variables, t = k 2 l! ’, X = \Jk 3 jk 4 c x , Y = y/k^/k^cy , 
A = y/k\k 3 /k\c A , B = (k 2 /k 4 ) c B , and D i = D' i /k 4 , then Eqs. (12.51) and 
(12.52) take the form 

^ = A - (B + l)X + X 2 Y + Di V 2 X (12.53) 


and 


— = BX-X 2 Y + D 2 V 2 r Y. 


(12.54) 
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Equations (12.53) and (12.54) have a spatially uniform steady-state solution of 
the form 


X Q =A and Y 0 = B/A, (12.55) 

which is the continuation far from equilibrium of the steady-state solution 
which occurs at chemical equilibrium when the reverse reactions in Eq. (12.50) 
are allowed to occur. That is, the steady state [Eq. (12.55)] lies on the 
“thermodynamic branch” of steady-state solutions. We wish to look for 
conditions under which the thermodynamic branch becomes unstable and a 
bifurcation (nonequilibrium phase transition) occurs to a state which may 
oscillate in space or time. 

We can use linear stability theory (cf. Appendix C) to look for instabilities in 
the steady-state homogeneous solution to Eq. (12.55). We write X and Fas 

X(r,t) =A + <fc(r, t) ( 1 2.56) 

and 

Y(r,t)=* + by(r,t), (12.57) 


where fix and by are small space and time-dependent perturbations. We shall 
introduce Eqs. (12.56) and (12.57) into Eqs. (12.53) and (12.54) and linearize 
them with respect to 6x(r, t) and <5y(r, t). We then obtain 

— (B — l + D X V \) bx + A 2 by (12.58) 


and 


dby 

~df 


—Bbx + (—A 2 + DyS/l) by. 


(12.59) 


Since Eqs. (12.58) and (12.59) are linear, it is sufficient to consider one Fourier 
component of &c(r, t ) and by(r, t ). Before we proceed further, we must specify 
both the shape of container in which the reaction takes place and the boundary 
conditions on the walls of the container. 


12.D.2. Boundary Conditions 

For simplicity, we shall assume that the reaction takes place in a rectangular 
container with sides of length L x ,Ly, and L z . We shall consider two different 
types of boundary conditions. For Case I, we will hold the concentrations, X and 
X constant on the boundaries. For Case II, we will assume there is no flux of X 
and Y through the boundaries. Let us consider these two cases separately below. 
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Case I Boundary Conditions : Let us assume the concentrations, X and Y are 
constant on the boundaries and take on the values 

Xbound ~ A and Y bound = B / A 

on the boundaries. These boundary conditions can be maintained by allowing a 
flow of X and Y through the boundaries. A given Fourier component of &t(r, t) 
and £y(r, t) can be written 

&c(r, t) = x(k, <jj) sin (k x x) sin (k y y) sin (k z z) (12.60) 

and 

6y( r, t) =y(k, uS) sin (k x x) sin ( k y y ) sin (fc z z)e w(k)f , (12.61) 


where 


n r 7r 


n v 7r „ n z 7r 


k = x + y + z, 
L x ^ Ly y ^ L z 


and n x , n y , and n z can each have values 1 , 2 , . . . , oo. 

Case II Boundary Conditions : Let us assume there is no flux of X and Y 
perpendicular to the boundary surfaces so 

n • V r X = n • V r T = 0 


on the boundaries, where n is normal to the boundaries. A given Fourier 
component of &t(r, t) and <5y(r, t) can now be written 

&c(r, t) =x(k,u;) cos ( k x x ) cos (^y) cos (k z z) (12.62) 

and 

6y(r, t) =y(k,u;) cos (^x) cos (k y y) cos (k z z)e^^ (12.63) 


where the wavevector k is defined as in Case I and n x , n y , and n z can each have 
values 0, 1 , 2 , . . . , oo. 

If we substitute either of the solutions — Eqs. (12.60) and (12.61) or Eqs. 
(12.62) and (12.63) — into Eqs. (12.58) and (12.59), we obtain the following 
matrix equation 


{ u — B T 1 T" D x k 2 

V B 


= 0 . 


-A 2 

cj -j- A 2 + D y k 2 


x 

y 


(12.64) 
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An equation for u;(k) is found by setting the determinant of the 2 x 2 matrix on 
the left equal to zero. We then have 

cu(k) 2 + (Ci - C 2 )u{ k) +A 2 B - CjC 2 = 0, (12.65) 

where 

C\ = B — \ — k^Dx and C 2 =A 2 + k 2 D 2 . (12.66) 

Equation (12.65) has two solutions: 

w*(k) =i{C, - C 2 ± ^/(C, +C 2 ) 2 -4A 2 B}. (12.67) 

From Eq. (12.67), it is easy to see that, depending on the variables D\,D 2 , A, 
and B, the frequency cu(k) can be either complex or real and the real part can be 
either positive or negative. Below we will consider the cases of real frequency 
and complex frequency separately. 


12.D.3. Linear Stability Analysis 

To determine if a given mode is unstable, we must substitute the solutions — 
Eqs. (12.60) and (12.61) or Eqs. (12.62) and (12.63) — into Eqs. (12.58) and 
(12.59) and find an equation for the frequency u;(k). In general, o;(k) can be 
complex. If it is complex, the component of the perturbation Sx( r, t) and <Sy(r, t) 
with wavevector k and frequency tu(k) will oscillate about the steady state 
(12.55) with frequency given by the imaginary part of u;(k). The real part of 
w(k) determines if the steady state is stable with respect to the perturbation. If 
the real part of u>(k) is negative, then the perturbation will decay away and the 
steady state is stable with respect to fluctuations with wavevector k and 
frequency tu(k). However, if the real part of u;(k) is positive, the perturbation 
&*(r, t) and 6y(r, t) grow exponentially. Thus, when the real part of o;(k) is 
positive, a bifurcation or phase transition occurs. Linear stability analysis 
cannot tell us the form of the new state, but it can tell us where the bifurcation 
occurs. 

12.D.2.1. Real Frequency a>(k) 

The frequency u>(k) is real when 

(Ci + C 2 ) 2 - 4A 2 5 > 0 (12.68) 

[cf. Eq. (12.67)] and tu(k) is positive if 


CiC 2 -A 2 B> 0 


(12.69) 
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B 



Fig. 12.4. Linear stability diagram for 
the onset of a time-independent dissipa- 
tive structure. For the case pictured here, 
as B increases, the first bifurcation 
occurs at n = 2. (Based on Ref. 3.) 


or 


B>B n = \+^-A 2 + 


A 2 


D 2 n 2 7t 2 


L 2 + 


~1T~ 


(12.70) 


In this case, there will be no oscillations in time but only in space. The linear 
stability diagram for the onset of time-independent spatial oscillations of the 
concentrations of X and Y is given in Fig. 12.4. The curved line is B n . The 
bifurcation first occurs for B = B c , where B c is the lowest value of B 
corresponding to an integer value of n. Linear stability analysis can only tell us 
that a bifurcation to a new spatially oscillating steady state is possible for a 
particular set of values for the parameters A, B, D \ , D 2 , and L. It cannot give us 
information about the form of the new state. However, for the Brusselator the 
spatially varying states can be obtained numerically. In Fig. 12.5 we give an 
example for a one-dimensional system for the following values of the 
parameters: B — 4.6, A = 2,L = \,D\ = 1.6 x 10 -3 , D 2 = 6.0 x 10~ 3 , and 
fixed boundary conditions X boU nd = A and Tbound = B/A. The figure shows a 
state with oscillations in the concentration of X as a function of position. 

12.D.2.1. Complex Frequency o(k) 

The frequency o;(k) is real when 

(Ci + C 2 f -4A 2 B > 0 (12.71) 

[cf. Eq. (12.67)]. From Eq. (12.71), we can show that tu(k) will be complex if 

£>2-£>i<4-2- ( 12J2) 

n L 7T 

In addition, cu(k) will have a positive real part if Ci — C 2 > 0 or if 
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Fig. 12.5. A steady-state spatial dissipative 
structure for B = 4.6, A = 2, L = 1, 
D\ = 1.6 x 10" 3 , and = 6.0 x 10' 3 . The 
boundary conditions are fixed at X — A and 
Y = B/ A. (Based on Ref. 3.) 


Thus, the curve 

n 2 J2 

B n = A 2 + \ + -jj- (D\ + Di) (12.74) 

denotes the boundary between the region where the steady state [Eq. (12.55)] is 
stable and where it is unstable as a function of n. 

We note that for the Case II boundary conditions we can have n = 0 and, 
therefore, the possibility of spatially homogeneous oscillations in time of the 
relative conentrations of X and Y - — that is, a chemical clock. For all finite values 
of n, we have the possibility of a new state which varies in both space and time 
and therefore may exhibit wave motion. The linear stability diagram for the 
transition to time-dependent states is given in Fig. 1 2.6. In Fig. 1 2.7 we give the 
results of a computer simulation, of a one-dimensional system, for a state which 
oscillates in space and time. The solution is reminiscent of a standing wave on a 
string, but, due to the nonlinearities, it has a much more complicated structure. 
In two dimensions, computer simulations have shown that traveling waves can 
exist which are similar to the waves shown in the Belousov-Zhabotinski 
reaction [Fig. 12.3 (cf. Ref. 3)]. 

Spatially varying steady states, temporally oscillating homogeneous states, 
and nonlinear traveling waves have been observed in chemical systems. These 



Fig. 12.6. Linear stability diagram for the 
onset of time-dependent dissipative struc- 
tures. As B is increased, the first bifurcation 
occurs at n = 0 and yields a spatially 
homogeneous chemical clock. (Based on 
Ref. 3.) 
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Fig. 12.7. Successive time steps for a dissipative structure with fixed concentrations on 
the boundary which exhibits oscillations in both space and time. The parameters used 
are L = 1, A — 2, B = 5.45, D\ — 8 x 10~ 3 , and D 2 = 4 x 10~ 3 . (Based on Ref. 3.) 


states, which only become stable far from equilibrium, have been called 
“dissipative structures” by I. Prigogine [2, 3] because their very existence 
depends upon dissipative processes, such as chemical reactions far from 
equilibrium or diffusion (if spatial structures occur). As we have seen in Section 
12.B, they are maintained by production of entropy and by a related flow of 
energy and matter from the world. It is curious that the type of autocatalytic 
reactions which produce them are abundant in living systems and yet are rare in 
nonliving systems. It is possible that dissipative structures in living systems 
play an important role in the maintenance of life processes [14], 

12.E. THE RAYLEIGH-BENARD INSTABILITY [15-19] 

Nonequilibrium phase transitions are abundant in hydrodynamic systems 
because they are governed by nonlinear equations. For example, if we let fluid 
flow in a pipe, we find for low velocities or high viscosities that the flow is 
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smooth and steady. However, as we increase the velocity or decrease the 
viscosity, we get a transition to turbulent flow. The smooth steady state becomes 
unstable and a turbulent state becomes favored. As another example, let us 
consider a fluid at rest and place it between horizontal parallel plates in the 
gravitational field. If we put a temperature gradient across the plates (with the 
hottest plate below) and slowly increase the temperature gradient, we find that 
at some point the rest state becomes unstable and the fluid breaks into 
convective flow cells which occur periodically in space. In each cell, fluid rises 
in one part of the cell and descends in another part. The circulation of fluid 
repeats itself in each cell. This instability is called the Rayleigh-Benard 
instability and is the one we shall study in this section. We will follow closely 
the classic presentation of Chandrasekhar [15]. 

12.E.1. Hydrodynamic Equations and Boundary Conditions 

Let us first write the hydrodynamic equations in a form which will be useful to 
us. The full nonlinear Navier-Stokes equations can be written 

^ + V r -/9V = 0, (12.75) 

^ + V r ■ (pvv) = -V r /> + pF + r,V 2 r v + « + |tj) V r (V r • v), (12.76) 

and 

— + v r ■ - J V r T) = n : V r v, (12.77) 

where p is the mass/volume, s is the entropy/mass, and the remaining quantities 
have been defined in Section 10.B. We will consider the case of a fluid at 
rest constrained to lie between two infinitely large parallel plates which 
extend in the x- and y directions. The distance between the plates is d. We 
will put a temperature gradient across the plates so that the temperature of 
the bottom plate is greater than that of the top, and we will assume that 
a gravitational field acts in the negative z direction (cf. Fig. 12.8). If we 
have a thin enough fluid layer, any density variations in the system will be 
due primarily to the temperature gradient (as opposed to gravity) and we can 
write 


p = p 0 [l + a P (T 0 -T)\, (12.78) 

where a P = —(1/p) (dp/dT) p is the thermal expansivity. Heating the bottom 
more than the top causes the fluid at the top to be denser and heavier and creates 
the possibility for an instability to occur. 
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Td d 

i- t* 

T 0 0 

Fig. 12.8. Geometical configuration for the Benard problem. 

Let us now write the equations for the steady state in the absence of any 
macroscopic flow; that is, v = 0. We obtain 

V r -pv = 0, (12.79) 

v r P = -pgz, (12.80) 

and 

V 2 r T = 0, (12.81) 

where we have let F = —gz and g is the acceleration of gravity. If we note the 
boundary conditions 7(0) = T 0 and T(d) — Td, we can solve Eqs. (12.80) and 
(12.81) to obtain the steady-state solutions. We find 

T(z) = T 0 -az, (12.82) 

p(z) = PO [1 + a/* (To - T)} = poll + atpaz], (12.83) 

and 

P(z) = Po ~ gpo [z + \ a P az 2 \, (12.84) 

where a is the temperature gradient, a = (T 0 — Td)/d, and P 0 is the pressure at 
z = 0. 

Equations (12.82)-(12.84) are the steady-state solutions in the absence of 
flow. We wish to determine conditions for which they are stable. As usual, 
we will perturb the steady state solutions slightly and study the linearized 
equations for the perturbations. Thus, we write 7’(r, t) = T(z) + 6T(r,t), 
P( r, t) = P(z) + 6P( r, t ), and p( r, t) — p(z) + 6p( r, t). The velocity, v(r, t), is a 
first-order perturbation from equilibrium. Let us further note that the dominant 
contribution to entropy variations will come from temperature variations. Thus, 
we write 6s = (c p /T) 6T, where c p is the constant density specific heat, and we 
have neglected terms proportional to a P . If we substitute these expressions into 
the hydrodynamic equations, and linearize Eqs. (12.75)-( 12.77) in the 
perturbations ST, 6P , 6p, and v, we find 

|ip = -V r -(p(z)v), (12.85) 

a 

— p(z)v = - Vr (P(z) + 6P) - pgz + 7/Vjv + (C + 5 rj) V r (V r • v), 

( 12 . 86 ) 
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and 


A 

(p(z)c P ST) = -V, • [p(z)c,r(z)v - KV r (T(z) + ST)}. (12.87) 

Let us now note that 


Sp( r, t) = —pootp 6T( r, t) (12.88) 

[cf. Eq. (12.78)]. Then from Eq. (12.85) we have V r • v « a P . If we now 
neglect all terms which depend on except those that appear in the term 
involving the external field, we obtain 

V r • v = 0, (12.89) 

d\ 

Po-^ = - V r 6P + Poap STgz + T^VpV, (12.90) 

and 

|-^r = av 2 + -L VIST. (12.91) 

at p 0 Cp 

The above approximation is called the Boussinesq approximation. The 
consistency of the Boussinesq approximation has been demonstrated by 
Mihaljan [18]. Equations (12.89)-(12.91) form the starting point of our linear 
stability analysis. 

We can simplify these equations somewhat through the following steps. We 
first take the curl of Eq. (12.90) to obtain 

f)\7 v \ 

— Q — = gap V r x (STz) + i/,V? (V, x v), (12.92) 

where v t = q/po. We next take the curl of Eq. (12.92) and note that 
Vr x (V r xv) = Vr(V r • v)) — V(!v. Using Eq. (12.89), we then find 

A / AC»t»\ 

- Vjv = -ga P V t + g a P V ] (STz) + i/VJy. (12.93) 

In our stability analysis, we shall be interested in instabilities in the z 
components of velocity. The equation of motion for the z component of velocity 
is given by 


d_ 

dt 


VrV z = gap 


(dP_ 

\d!x 2 dy 2 ) 


8T + i/V 4 v z . 


(12.94) 
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If an instability occurs in which flow develops in the z-direction, then v z z must 
change sign as we move in the x- or y-direction (what goes up must come 
down). 

As for the case of the Brusselator, we must specify boundary conditions 
before we apply the stability analysis. We first note the general boundary 
conditions that 


6T(x, y, 0; t) = 6T(x , y, d ; t) = 0 (12.95) 

and 

v z (x, y, 0; t ) = v z (x , y, d\ t) = 0. (12.96) 

In addition to the above general boundary conditions, we have additional 
constraints on the surfaces at z = 0 and z = d. We can have either rigid surfaces 
for which there can be no tangential components of velocity or smooth surfaces 
where we can have a tangential flow. 


Case /: For a rigid surface we have the boundary conditions 

\(x, y, 0; t) = \(x, y, d\ t) = 0. (12.97) 

Since v = 0 for all x and y, we find dv x /dx = dv y /dy = 0, and thus from Eq. 
(12.89) we have the additional condition dv z dz = 0. 


Case II: On a smooth surface there can be horizontal flow but no transport of 
the horizontal components of velocity in the z direction. Thus, the components 
of the stress tensor satisfy the condition 


tt xz (x, y, 0; t ) = ir xz (x, y, d\ t) = ir yz {x, y, 0; t ) = Tr yz (x , y, d\ t) = 0 


or 


and 



(12.98) 

^ + .0 
,9Z OxJ ^ 

(12.99) 

+ =0 

Oy J bound 

(12.100) 


Since v z = 0 for all x and y on the surface, we have 


dvA = 
Ox J bo Unc j 


W =° 

. / bound 


and 


' dv x \ = 

, ^ / bound 



= 0 

bound 

( 12 . 101 ) 


on the smooth surface. 
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12.E.2. Linear Stability Analysis 

We shall now look for instabilities in v z , as well as for fluctuations in T which 
vary in space in the x and y directions. Since in the linear stability analysis we 
work with linearized equations, we only need to consider one Fourier 
component. Thus, we shall write 

v z (r, t) = £>“' (12.102) 

and 

T(r, t ) = f{zy^ x+k ^ (12.103) 

where V z (z) and t(z) are assumed to depend on k = k x \ + k y j and on u. 
Substitution into Eqs. (12.91) and (12.94) yields 

“ t = av > + ^{£- k2 ) f (i2i04) 

and 

w(^-t 2 )v' 2 = -«o,t 2 f + 1 /(^-t 2 ) V, (12.105) 

From the general boundary conditions we must have T(0) = f(d) =0 and 
V z (0) = V z (d) = 0. On a rigid surface we have 

dVzl = dV z | =Q 

dz l z =o dz \ z =d 

On a smooth surface we have 

<PVz \ _cfV z \ = 

dz 2 | z==0 dz 2 \ z=d 

It is convenient to introduce a change of length scale. We will let = z/d, 
s = ud 2 /u, a — kd, and P = vpoc p /K (P is the Prandl number). Then Eqs. 
(12.104) and (12.105) can be combined and written 

~ a2 ) “ < X 2 ~ 5 ) (^2 _ a -Rs ) = ~ Rq;2 ^(^) 


(12.106) 
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and an identical equation holds for T(z). In Eq. (12.106), 


R = 


gapoc p apd 4 
vK 


(12.107) 


is the Rayleigh number. In Exercise 12.2 we show that for the Rayleigh-Benard 
system the rescaled frequency, s, must be real. 


■ EXERCISE 12.2. Show that s, the rescaled frequency, must be real. 
Answer: We first define two functions, G(&) and F(£l f), such that 

(1) 

and 




Then Eq. (12.106) can be written 
>2 


d 2 


d 2 


d & 2 a s ) [d & 2 a ) Vz ^' 

( 2 ) 


(^2 - « 2 - P *) F W = -R<* 2 V Z (&). 


(3) 


We next multiply by the complex conjugate of F(&) and integrate from 0 
to 1: 




J 


F*V 7 d2£ . 


(4) 


An integration by parts, together with the boundary conditions in Eqs. 
(12.95) and (12.96), yields for the first term on the left 


f 


d 2 

F* — -r Fd% 


d % 2 


f 


1 'i 'L l 

d&\ 


d&. 


(5) 


If we now use the definition of F*{2£) obtained from Eq. (2) and integrate 
by parts to rearrange the right-hand side of Eq. (4), we find, after some 
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algebra, 


HI +« 2 |/f + P*|F| 2 W 


= Ra 2 




dV 7 


( 6 ) 




+ c * 2 l^l 2 


d&. 


The real and imaginary parts of Eq. (6) must vanish separately. Thus, we find 


Im (5) 


[ P|Fj 2 + Ra 2 [ 
Jo Jo 


dV 7 




W\v 7 \ d% 


= o, (7) 


where Im ( s ) is the imaginary part of s. Since the term in square brackets in 
Eq. (7) is positive and finite, the imaginary part of s must vanish. Thus, s is a 
real frequency as long as R and P are positive numbers (R and P must be 
positive from thermodynamic stability considerations). 


The transition from the steady state in Eqs. (12.82)-(12.84) to a new steady 
state (often called a “soft mode” transition when the frequency is real and no 
temporal oscillations are allowed) occurs when 5 = 0. For s < 0, fluctuations 
are damped. For s > 0, they can grow. To find conditions under which the 
transition occurs, we set s = 0 in Eq. (12.106) and find values of R which 
satisfy the equation 


Of - " 2 ) ^w=- r “ 2 ^( 2 ')- ( i2 - io8 > 

Thus the whole problem has been reduced to an eigenvalue problem. In 
Exercise 12.3 we solve this eigenvalue problem for smooth boundary 
conditions. We find that a nonequilibrium phase transition first occurs, as 
Rayleigh number is increased, at Rayleigh number R = R c = 277r 4 /4. At this 
Rayleigh number a mode emerges which has wavevector q = k x x + k y y with 
magnitude q 2 = q 2 — 7T 2 / (2 d 2 ). 


■ EXERCISE 12.3. Consider a Rayleigh-Benard system with smooth 
boundary conditions. Compute the lowest value of the Rayleigh number for 
which an instability can occur. 
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Answer: Boundary conditions for two smooth boundaries are such that 


d 2n ~ \ 

5- V z (&) = 0 

Qgtn z ' > 


for all integer n. Thus, V z {2£) must have the form 

V z {3?) = A sin nn$?, (2) 

where n = 1, 2, . . . , oo. Substitution into Eq. 12.108 leads to the eigenvalue 
equation 

D (»V + o i ) ! 


The smallest value of R for which an instability can occur is for n — 1 or 


(-7T 2 + a 2 ) 3 


If we plot R as a function of a, we find a critical value of a (a is related to 
the size of cells in the x and y directions) for which R takes on its lowest 
value. To find it analytically we must find the position to the minimum of R. 
Thus, we set 

|R =3 (^f_(^+o^ = 0 (5) 

da 2 a 2 a 4 

and obtain 


a c = —j= — 2.22. (6) 

V2 

The critical wavelength (the distance across a cell in the x-y plane) for the 
onset of instabilities is 

A„ = — = 2 (7) 


and the critical Rayleigh number is 


Rc = — 7i = 657.51. 

4 

Note that the critical wavelength depends on the size of the container. 
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The case of two smooth surfaces, while easy to analyze (cf. Exercise 12.3), is 
difficult to realize in experiments. Generally, in experiments one has one 
smooth surface and one rigid surface, or two rigid surfaces. For these more 
realistic boundary conditions the analysis preceeds along similar lines but is 
slightly more complicated [15] and will not be discussed here. For the case of 
one rigid and one free surface, one finds R c = 1100.65 and a c = 2.68. For the 
case of two rigid boundaries, one finds R c = 1707.76 and a c = 3.12. 

Let us now summarize our results. For the region in which the steady state in 
Eqs. (12.82)-(12.84) is stable, the system is homogeneous in the x and y 
directions. However, at the transition point, convection begins and the system 
breaks into cells in the x and y directions. The cells will have a periodicity in the 
x and y directions which is proportional to d, the distance between the plates. 
We expect the cell walls to be vertical and that at the cell walls the normal 
gradient of velocity v z (r, t) is zero. Thus, at the cell walls we have 

(n.V rx v t (r,0)=O, (12.109) 

where V rx = i(d/dx) + }{d/dy) and n is normal to the walls of the cell. 



Fig. 12.9. Benard instability. In a circular covered container, rolls usually form. The 
bright areas represent horizontal motion and the fine white lines are settled aluminum 
powder. (Reprinted, by permission, from E. L. Koschmeider, Adv. Chem. Phys. 26, 177 
(1974).) 
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The simplest type of cell pattern that can develop is that of a roll — that is, a 
cell pattern which depends only on one horizontal coordinate, say x. Then we 
have 


v z (r,f) = V z (z) cos kx, (12.110) 

where k is the wavevector of the rolls. 

The theory we have developed here assumes that the planes are of infinite 
extent in the x and y directions. In practice, the experiments must be done in 
finite containers. Then the cell pattern that emerges is strongly dependent on the 
shape of the container. For rectangular containers and circular covered 
containers, rolls usually form. Square cells can be formed in square containers. 
In uncovered circular containers, where the effects of surface tension are 
important, hexagonal cells usually form [19]. In Figs. 12.9 and 12.10 we give 
some examples of rolls and hexagons, respectively, which have been observed 
experimentally. This transition has also been observed in molecular dynamics 
experiments on a two-dimensional Rayleigh-Benard system [20, 21]. In the 
special topics section we describe the behavior of fluctuations in the 
neighborhood of the transition point. 



Fig. 12.10. Benard instability. In an opened container where the effects of surface 
tension are important hexagonal cells usually form. The dark lines indicate vertical 
motion, and the bright lines indicate predominantly horizontal motion. (Reprinted, by 
permission, from E. L. Koschmeider, Adv. Chem. Phys. 26, 177 (1974).) 
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SPECIAL TOPICS 

^ S12.A. Fluctuations Near a Nonequilibrium Phase Transition 

In both chemical and hydrodynamic systems, as we move away from 
equilibrium, new macroscopic modes (dissipative structures) can suddenly 
appear in a system, leading to dramatic new macroscopic behavior. In the 
Brusselator, chemical clocks, chemical waves, and chemical “crystals” 
(stationary spatially varying chemical concentrations) can emerge far from 
chemical equilibrium. A picture of chemical waves in the Belousov- 
Zhabotinski reaction was shown in Fig. 12.3. 

In the Rayleigh-Benard system, stationary convective spatial structures can 
form. These dissipative structures appear at a critical Rayleigh number in the 
form of a stationary convective mode which dominates the macroscopic 
behavior of the fluid as the Rayleigh number is increased. Some examples of 
such modes were shown in Figs. 12.9 and 12.10. 

Ginzburg-Landau theory has proven to be a very useful tool for the analysis 
of nonequilibrium structures. The reason is that in the neighborhood of the 
critical point, the new modes exist on much slower time scales than order 
modes. Their behavior can be isolated and analyzed independently of the other 
degrees of freedom. Below we will show how this may be done for the lowest 
mode in the Rayleigh-Benard system and for the onset of stationary spatial 
structures in the Brusselator. 


► S12.A.1. Fluctuations in the Rayleigh-Benard System [22-24] 

In the Rayleigh-Benard system, the lowest nonequilibrium mode for the case of 
smooth boundary conditions on the horizontal planes at z = 0 and z — d was 
identified in Exercise (12.3). From linear stability analysis we found that this 
mode had a horizontal velocity variation of the form 

v z (k, t)=A sin ( j) ( 12. 1 1 1 ) 

where A is a constant. It appears at a critical Rayleigh number, R c = 27 7r 4 / 4. 
The critical wavevector in the vertical direction is (k z ) c = p c = ix/d. If we 
denote q 2 = k 2 4- k 2 , then the critical wavevector in the horizontal (x, y ) 
direction is given by q 2 = it 2 fid 2 . One can also introduce a critical total 
wavevector, k 2 = q 2 p 2 = 37r 2 /2 d 2 . 

In the isotropic system, near the critical point, it is possible to obtain an 
expression for the correlation function for the mode that will appear at the 
critical point. We will consider a system with Rayleigh number, R < R c , just 
below the critical value. Even though the system is isotropic, the mode that 
appears at R = R c will begin to dominate the fluctuation spectrum because it 



754 


NONEQUILIBRIUM PHASE TRANSITIONS 


experiences critical slowing down near R = R c . If it is excited, it takes a 
relatively long time to decay away. This fact allows us to isolate the new mode 
and treat the many degrees of freedom of the background fluid as a rapidly 
oscillating noise acting on the new mode. Since fluctuations in the background 
fluid oscillate very rapidly in time, we can use the expressions for 
hydrodynamic random noise correlation functions introduced in Section 
S10.F. The use of the equilibrium noise correlation functions to describe 
nonequilibrium hydrodynamic phenomena has been found to give good 
agreement with numerical simulations of nonequilibrium hydrodynamic 
systems [25]. 

► SI 2. A. 1.1. Equations of Motion 

Let us write the linearized equations for the temperature and velocity variations 
obtained in Section 12.E, but now include the random background contributions 
to the stress tensor and heat current which were introduced in Section S10.F. 
From Eqs. 10.311 and 12.90, we find 

Ay — 

Po-^= - V r 6F + poocpgSTz + rjVl v - V r • S. (12.112) 

From Eqs. (10.310) and (12.91) we find 

Poc p ^ = p 0 c p av z + KV 2 , 6T - V, • g„. (12.1 13) 


We are interested in how the z component of velocity varies in the (x, y ) 
direction. Let us take the curl of Eq. (12.1 12) twice and use the vector identity, 
V r x (Vr x v) = — V^v + V r (V r • v). Then the equation for v z (r, t) takes the 
form 


d _2 d 2 6T _ 4 

Po -Q t Vr V z = PoOpgV t {6T) - p 0 Oipg -7^2 ~ + Wr 

+ Z-(VrX(VrX(V r -S))), 


(12.114) 


Since Eqs. (12.113) and (12.114) are linear equations for 6T and v z , we can 
study each Fourier component separately. Therefore, we let 

v z (r ,t) = V z (k,u)e iq ' r e~ luJt sin (k z z), 

<5r(r,f) = f(k,u)e ,q r e~ luJt sin (k z z), 

S(r, t ) = (S + e ip ‘ z + S_ e~ ip *) e iq ri e~ iuJt , 
and g,(r, t) = (g + <* 2 + g_ e~^ z )e^ e' 1 *, (12.1 15) 

where r ± =xx + yy, S± = S (k=,w), g ± = glk*,^), and k* = q ±p z z- We 
will imagine our system confined to a volume V = 8dL 2 (length 2 L in the x 
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and y directions and 2d in the z direction) with periodic boundary condi- 
tions. The temperature and z component of velocity must be zero at z = 0 and 
z = d. 

Equations (12.113) and (12.114) can be combined and take the form 


"if H 


U t k 2 — 


g&pq 2 \ 
k 2 
Kk 2 


pQCp 


) 


f 


( - [z • (k + x (k + x (k + • S + ))) - z • (k- x (k- x (k- • s-))) \ 


M 2 


PQCp 


g + -k g_] 


) 

(12.116) 


where on the right-hand side we have used the expansion sin (x) = 
(1/2/) (e* — e~ x ). The nature of the macroscopic mode that emerges at the 
critical point is determined by the eigenvalues and eigenvectors of the matrix on 
the left-hand side of Eq. (12.116). We will determine these below. 


► SI 2. A. 1.2. The Critical Eigenmode 

Let us find the eigenvalues and eigenvectors of the matrix, 


v t k 2 

ga P q 2 \ 
k 2 


/V ga :f\ 

k l 

c a 

Kk 2 


aga P d A k 2 

PQCp ) 


a i/R / 


where R = aga P pQC p d A / vK is the Rayleigh number. It is instructive first to 
write this matrix at the critical point where R = R c = 277r 4 /4, q 2 = 
q 2 = 7T 2 lid 2 , and k 2 = k 2 = lit 2 jld 2 . Then 


M c 


(3iW 

2d 2 

\ ~ a 


gap \ 

3 

2 agapd 2 

^ 7) 


The matrix M c has eigenvalues 


A s c = 0 and X} = - 


4aga P d A + 21v 2/ k a 

lSd 2 U t 7T 2 


(12.118) 


(12.119) 


where denotes the eigenvalue of the slow mode at the critical point and X c f 
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denotes the eigenvalue of the fast mode at the critical point. The mode with 
eigenvalue X c s = 0 at the critical point is the mode which emerges and becomes 
macroscopic as the Rayleigh number is increased further. This happens because 
the eigenvalue X s changes from negative to positive at the critical point and for 
R > R c this particular mode can grow exponentially. 

The matrix M c is not self-adjoint. Therefore, its left eigenvectors differ from 
its right eigenvectors. The left and right eigenvectors are easy to find. At the 
critical point, the orthonormalized left and right eigenvectors for the mode with 
eigenvalue X c s are given by 


1, 


3v t 7r 2 
lad 2 


and \J/£ = A 


( gap \ 

3 

3v t 'K 2 
\ lad 1 ) 


respectively, where the normalization constant is given by 

, lad 2 

A = 




(12.120) 


( 12 . 121 ) 


The left and right eigenvectors of the mode with eigenvalue X c f can also be 
found, but since we won’t need them, we won’t write them. 

We can also obtain the eigenvalues and eigenvectors of M near the critical 
point but with R < R c . We will focus on the mode with wavevector 
k z =p z = rc/d. Therefore, we set k z = ir/d in the matrix M. We then find 
the eigenvalues and eigenvectors for noncritical values of q 2 and R. Once 
we have them, we expand about the critical point. That is, we set 
q 2 = ( 'K 2 jld 2 ) (1 + 8q) and R = (277r 2 /4) (1 — <5R) and expand in powers of 
8q and <5R. To lowest nonzero order in 8q and <5R, we find 


Aj = 


"tk 2 

1 + P 


R c — R (q 2 -q 2 ) 2 \ 

Rc + kW c ) + 


(12.122) 


where P = u t poc p fK is the Prandl number. To lowest order in 6q and 8R, we 
approximate the left and right eigenvectors by their values at the critical point. 
These were given in Eq. (12.120). 


► SI 2. A. 1.3. Correlation Function for the Macroscopic Mode 

Let us now isolate the critical eigenmode from the equations of motion. If we 

multiply Eq. (12.116) by the left eigenvector, % c s , we find 


{u) + iX s (q, R))u q (u) 


. 1 
1 Pok 2 


vk 2 

apoCp 


(G+-G-), 


(12.123) 


(S+-S-) 
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where S ± = z ■ (k± x (k* x (k* • S (k^w)))), G ± =k ± ■ g s (k ± , u ;), and 

«q(w) = v z (k,u) + ^ f (k,w) (12.124) 

is the critical eigenmode. The correlation function for the critical eigenmode 
can be written 

<«,, ( Wl ) **(<■*)> = ( j- + . + ^ 

7,2^4 

x Tofe (( G r- G i)( G 2 + - G 2 )) (12.125) 

A) C p 

+ ^i«sr-sr)(s 2 + -s 2 -)> . 


From Eq. (10.349), we find 

(z • (ki x (kt x (ki • S (ki, wi))))z • (k 2 x (k 2 x (k 2 • S (k 2 , u; 2 ))))) 

3 3 3 3 

= ku ( kizk{ } fij t zk] )k 2m (k 2z k 2 n ^n,z^ 2 ) 

i=l j=l m— 1 n= 1 

x (Sy(ki,u;i)S m „ (k 2 ,u; 2 )) 

= -47r^rr 7< 5(k 1 + k 2 )0( Wl + u 2 )k\ {k\ - k] z ). 

(12.126) 

Similarly, from Eq. (10.350), we obtain 


3 3 

(kj • g 5 (ki,o;i)k 2 • g s (k 2 ,u; 2 )) = EE kuk 2 j (&i(ki , ui ) (g sJ ( k 2 , u> 2 )) 

i= 1 7=1 

= -47r^r 2 ^^(k 1 + k 2 )%i + u; 2 ). 

(12.127) 

Note that (G^GJ ) = (G7G2 ) = 0 because 6 (kf 4- k 2 ) = 0 since the argument 
kf + k 2 cannot be zero. Similarly, {S^S 2 ) = (Sj'S^). Therefore, the mode 
correlation function reduces to 


<M q! (u7l)Mq 2 (u; 2 )) = 


(o7i 4- i\ s i) (a ) 2 4- i’A 52 ) 

r ^ 

x i ( W g 2 ) + (°r G 2 + )) 

\ a Po C p 


(12.128) 


1 



((s+s 2 - ) + (srs 2 + )) 
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Use of Eqs. (12.126) and (12.127), then give us the following expression for the 
correlation function for the macroscopic mode for Rayleigh number just below 
the critical Rayleigh number. 


<Mq, (uJi)u.q 2 (<^ 2 )) 


A c <$(qi 4- q 2 )^(^i + ^ 2 ) 

M + Ajj ) 


and 


A c 


8 nk B Tvq 2 c 
Po 



3 k 2 yr 

a 2 c p P 


Comparision of Eqs. (12.129) and (10.324) shows that 


(12.129) 


(12.130) 



dte luJt (Mq, (t)u - q , (0)) = 


A c 

2irV\ sl 



(12.131) 


If we take the inverse Fourier transform of Eq. (12.131), we find 


(0)> =^3- 


due 11 


Aji 


u 2 + Ajj 


47rA, 


(12.132) 


There are several things to notice about Eq. (12.132). First, as we approach the 
critical point, the fluctuations in this mode take longer and longer to die away. 
However, something else happens. This correlation function begins to diverge 
because of the dependence on Aji in the denonimator. Near the critical point the 
linear theory we have been considering breaks down and nonlinear effects must 
be included. 

In Exercise 12.4 we show that a time-dependent Ginzberg-Landau equation 
with a Gaussian free energy can reproduce the correlation function obtained in 
Eq. (12.132). Fluctuations in the unstable mode can be assumed to satisfy a 
time-dependent Ginzburg-Landau equation, 


where 


du q {t) 

dt 


6F 

6u- q (t) 


-f-T7q(0, 


(12.133) 


<77q(a;)77-q (-a;)) 


Ac 
2n ’ 


(12.134) 


and the free energy is F = \ J dqA,y(q)|wq(OI ■ 

If we wish to study behavior of the Rayleigh-Benard system very near the 
critical point, we must include nonlinear terms for the slow mode in the free 
energy [23, 24]. The form of the non-Gaussian corrections to the free energy 
that must be included are determined by the nonlinear terms in the 
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hydrodynamic equations. Inclusion of nonlinear terms in the Ginzburg-Landau 
free energy also is necessary to determine the symmetry properties of the 
emerging convective structures — that is, whether they are hexagons, rolls, and 
so on [26]. Linear stability theory has only given us the magnitude of the critical 
wavevector, q c , but says nothing about its direction. At the level of linear 
stability theory, there is a continuous degeneracy in the direction of q. One 
needs to (a) include nonlinear corrections to the equations of motion in order to 
remove this degeneracy and (b) determine the spatial symmetry of the emerging 
modes. 


■ EXERCISE 12.4. The Fourier amplitude, mr(0 = ^ 2k e ‘ kr «(r, t), of a 
fluctuating mode, u(r,t), in a nonequilibrium system (Vis the volume) is 
assumed to satisfy a time-dependent Ginzburg-Landau equation 


du k (t) 

dt 


£«-k (0) 


+ Vk (t), 


where k is a wavevector, F is a free energy functional, and rjk (t) is a random 
noise. In the Gaussian approximation, the free energy functional can be 
written F(t ) = J dk \ k u k (t)u- k (t), where Ak is a function of k and other 
relevant parameters of the system. The random noise satisfies the conditions 


(Vk(t)) v = 0 and (ffr, (t)rjk 2 (t)) r) = 2?rVr k <5(k, + k 2 ) 6(u\ 4- u%). 

The variational derivative is ( 6u k, (t) / 6u k2 (t)) = <5(ki — k 2 ). Assume that the 
medium is isotropic and that the correlation functions are time translation 
invariant but not time reversal invariant (since we are far from equilibrium). 

J What is the strength, I\, of the random noise [27]? 


J Answer: The equations of motion for w k (r) and w_ k (f) are 

| = -A k u k (t) + 77k(f) and = -A_ k «- k (0 + »?-k(0> 0) 

respectively. The averages evolve as 

j (uk(t)) v = e~ Xkt u k (0) and (K- k (f))„ = e~ x - kt u- k (0). (2) 

i Since the system is assumed isotropic and stationary, the correlation function 
has the property (u(r u h) n(r 2 , t 2 )) = C uu (n - r 2 , t x - t 2 ), and therefore 


(«k, (u\)u k2 (uj 2 )) = 27rV<5(ki + k 2 )^(q;i +u 2 ) 


dte luJlt (M kl (0 M -k,(0)). 

(3) 
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Let C uu (k, t) = («k(0 M -k(0)). Then under time translation invariance we 
obtain 

Cuu(k,t) = («k(0 M -k( 0 )) = (Kk(0)u-k(-0) = c uu{-k , -t). (4) 

The correlation functions can be written 

Cuu{ k,0 = <T Ak '(u k (0)M_k(0)) and C uu (-k,t) = e~ x ~ kt (u- k (0)u k (0)). 

( 5 ) 

Now take the Fourier transform of C uu (k, t): 


,(k,ur) = [ 


— oo 
C 00 


0 

r oo 


dtC uu (k,t) e~ iuJ ‘ 
dtC uu (k, t) e~ iuJt -h 
dtC uu (k,t)e~ ,UJt + 


o 

r oo 


dtC m (k,-t)e +i “ ' (6) 

dtC uu (-k, t) e +ka , 


where in the last term we have used Eq. (4). Next use Eqs. (5) and perform 
the integrations in Eq. (6). We obtain 


r,, , CUM) c„„(-k, 0 ) 

C “ (k ’“ ) “l^TAT + -/tu + A_ k 


(V) 


If we Fourier transform the time dependence in Eqs. (1), we can write the 
following noise correlation function 


(77k(u;)77_k(-u;)) = (iu + A k )(-iu; + A_ k )<«k(w)ii-k(“w)) 
= (Ak + A_k)C MM (k, 0), 

since C UM (k, 0) = C uu (— k,0). Thus, the noise strength is 
r k = (Ak + A_k)C MM (k,0). 


( 8 ) 


( 9 ) 


► SI 2.A.2. Fluctuations in the Brusselator [28, 29] 

Chemical clocks and chemical waves have been realized in the Belousov- 
Zhabotinski reaction (cf. Section 12.D) and have been seen in other 
autocatalytic chemical reactions as well (see, for example, Refs. 30 and 31)- 
Chemical “crystals” have been more difficult to achieve experimentally. They 
were first proposed by A. Turing [32] in 1952 and were finally realized in the 
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(a) 


o>; 

Fig. 12.11. Two-dimensional Turing structures produced with the CIMA chemical 
reaction in a two-dimensional gel. (a) Stripes, (b) hexagons. (Reprinted with permission 
from Nature (Q. Quyang and H. L. Swinney, 352, 610 (1991), MacMillan Magazines 
Limited.) 

laboratory for the first time in 1990 [33]. Chemical “crystals” or Turing 
structures, as they are now generally called, are stationary chemical 
concentration patterns (dissipative structures) which can emerge far from 
chemical equilibrium in nonlinear reaction-diffusion proceses. The reason they 
proved difficult to realize in the laboratory is that they generally require that the 
diffusion coefficients governing the reaction be very different in value [34]. This 
was first achieved using the so-called CIMA reaction (a chlorite-malonic acid 
reaction [30])) in a two-dimensional gel. It has now been well-studied and can 
be made to undergo a transitions to a variety of spatial structures [35]. Two such 




762 


NONEQUILIBRIUM PHASE TRANSITIONS 


structures are shown in Fig. 12.1 1. The Brusselator chemical model discussed in 
Section 12.D can reproduce these structures. Therefore we shall focus on it 
below. 

► S12.A.2.1. Equations of Motion and Stability Analysis 
Let us examine the chemical fluctuations that occur near the critical point for 
the onset of stationary chemical patterns. We will expand Eqs. (12.53) and 
(12.54) about the equilibrium stationary state so X(r, t) = A + 6x(r, t) and 
F(r,r) = B/A + 6y(r,t). If we keep all nonlinear terms in Eqs. (12.53) and 
(12.54) and, in addition, if we add random chemical noise contributions, £*(r, t) 
and £y(r,f), to the equations for concentration fluctuations, we obtain 

r\ C n 

— - — (B — \)8x + A 2 Sy + — ( Sx ) 2 + 2A6x6y + ( 8x) 2 6y + D x V 2 6x + 

ut Pi. 

(12.135) 

and 


= —B6x — A 2 6y — j (6x) 2 — 2A6x6y — (6x) 2 6y + D y V\8x + £ y , 

Ot P. 

(12.136) 

It is the linear part of Eqs. (12.135) and (12.136) that determines the critical 
point. If we Fourier transform both the space and time dependence of Eqs. 
(12.135) and (12.136) and keep only the linear terms, we obtain 



( (B - 1 - k 2 D x ) A 2 \ ( x k (u) \ 

V ~ B -A 2 - k lD y ) \ykH / 


The matrix 




Vly( k ^)/’ 

(12.137) 


(12.138) 


determines whether there are any mo des whi ch become critical. In fact a c ritical 
point occurs for B = B c = (1 + Ay/D x /Djf and k 2 = k 2 = A/ y/D x D y . Then 
for these values of B and k 2 , the matrix M has eigenvalues 


\ s (k c ,B c )=0 and Xf(k c ,B c )=A 



(12.139) 


where A s (k, B) is the eigenvalue of the slow mode and X f (k, B ) is the eigenvalue 
of the fast mode. Note that if D x & D y , then A/ « 0 and no separation of time 



SPECIAL TOPICS: A NONEQUILIBRIUM PHASE TRANSITION 


763 


scales occurs. This is part of the reason why Turing structures took so long to 
observe. For chemical reactions in fluids, all the diffusion coefficients generally 
are of the same order of magnitude. By placing the CIMA reaction in a gel, the 
diffusion coefficients could be changed and separated in value. 

The left eigenstate of the slow mode for B = B c and k = k c is given by 


Xs(kc,B c ) 



(12.140) 


We can also obtain the eigenvalue of the slow mode near the critical point. It is 
given by 


K(k,B) « 2 B c ) + 2 D x D y (k 2 - k]f. (12.141) 

V 

With this information, we can now separate the slow mode from the equations 
of motion and obtain its correlation function near the critical point. 

► SI 2. A. 2. 3. Correlation Function for the Critical Eigenmode 
If we multiply Eq. (12.137) by the eigenvector, Eq. (12.140), we find 

(iu- \ s (k,B))u k (u) = ijkM, (12.142) 

where Mk(u>) is the amplitude of the critical eigenmode, 

«k(w) = ^1 + *kM +yk(u>), (12.143) 

and ffc(u) is the chemical noise it experiences, 

%H= ( 1 +^)l, (k .«)+4 (k ,u; ) . (12.144) 

From Exercise (12.4) the noise satisfies the correlation function 

(lk(u)fj- k {-uj)) = 2\ s (k, B) (wk(0)w_k(0)) . (12.145) 

We see that the equation of motion of the critical eigenmode can we written in 
terms of a time-dependent Ginzburg-Landau equation with a Gaussian free 
energy (cf. Exercise 12.4) with a structure similar to that in Eq. (12.133), but 
with a noise strength given by Eq. (12.145). 

In order to determine the spatial symmetry of the Turing patterns that emerge 
as parameters are varied, it is necessary to go beyond the Gaussian 
approximation to the Ginzburg-Landau free energy and include higher-order 
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Fig. 12.12. Numerical simulation of Turing structures using the Brusselator chemical 
model and parameters of the CIMA reaction in a gel. Note the excellent agreement with 
experimental results in Fig. 12.11. (Reprinted from G. Dewel, et. al. Physica A213, 181 
(1995) with kind permission from Elsevier Science - NL, Sara Burgerhart Straat 25, 
1055 KV Amsterdam, The Netherlands.) 


terms in the free energy. The nature of these higher-order terms is determined 
by the type of nonlinearities that occur in the reaction diffusion equations that 
describe the chemical reaction process. A detailed discussion for the case of the 
Brusselator chemical model can be found in Refs. 28 and 29 (see Fig. 12.12). 

The study of autocatalytic chemical reactions in media such as gels, which 
can change the diffusion coefficients, is an area of growing interest. One of the 
more intriguing recent discoveries was the possibility of forming self- 
replicating spots in such systems [36]. These are circular spots of chemical 
concentration which grow and then subdivide and produce multiple spots. 


► S12.A.3. The Time-Dependent Ginzburg-Landau Equation 

The time-dependent Ginzburg-Landau equation has proven to be a very 
powerful tool for studying the dynamical behavior of systems near their critical 
points, both in equilibrium [37] and out of equilibrium. In this book we have 
focused on its applications to nonequilibrium phase transitions because they are 
more easily visualized. The use of the Ginzburg-Landau equation in both cases 
is based on the fact that at a critical point an unstable mode, u(r,t), emerges 
which has a time scale much slower that those of the rest of the fluid and plays 
the role of an order parameter [in the sections above we have worked with the 
spatial Fourier transform of u( r, f)]. Fluctuations in this unstable mode near the 
critical point obey a Ginzburg-Landau equation such as in Eq. (12.133). The 
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simplest non-Gaussian free energy one might encounter can be written 


F = 




+ \bu*(r) + ^c{Vu{r)) 2 


(12.146) 


where a, b, and c are parameters determined by the physics of the problem. The 
parameter a is an externally controlled parameter which plays the role of the 
temperature. When b = 0, the free energy reduces to the Gaussian approxima- 
tion. As one nears the critical point, order parameter fluctuations begin to 
exhibit critical slowing down and correlation functions begin to exhibit long- 
range spatial order, similar to the mean field behavior discribed in Section 8.B. 
However, very near the critical point, the Gaussian approximation and therefore 
mean field theory breaks down and higher-order contributions ( b ^ 0) to the 
Ginzburg-Landau free energy begin to play a dominant role in the behavior of 
the system. One of the few exceptions to this is the equilibrium superconducting 
phase transition which is well-described by mean field theory. Time-dependent 
Ginzburg-Landau equations have also been used to describe instabilities in the 
electromagnetic modes of lasers [23]. The detailed form of the nonlinear 
corrections to the Gaussian free energy is determined by the nonlinear 
contributions to the dynamical equations which governs the problem at hand. 
For electromagnetic modes of a laser [23] and for the lowest mode of the 
Rayleigh-Benard system [24], Eq. (12.146) has been used. For the Brusselator 
chemical model, cubic terms must also be added to the free energy [28]. 
Variations of Ginzburg-Landau theory have also been used to study problems 
of phase separation [38]. While we can’t go into all these applications here, we 
hope to have conveyed the fact that it is, indeed, a fascinating field of study. 

The Rayleigh-Benard instability is just one of several instabilities that 
emerge in Navier-Stokes fluids out of equilibrium. Flowing fluids exhibit a 
series of transitions which can lead to fully developed turbulence in these 
systems [3]. 
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PROBLEMS 

Problem 12.1. Use linear stability theory to locate the stable and unstable steady-state 
solutions for the Schlogl chemical model. 

Problem 12.2. For the Schlogl chemical model, show that multiple steady states can 
only occur far from thermodynamic equilibrium. 

Problem 12.3. A one-dimensional chemically reacting system has variable constituent, 
X, whose rate equation is linear, 

^ = kc x -DV 2 r c x . 

Assume that the concentration, c x , is held fixed on the boundaries with values c x = c 0 . 
Find the steady-state value of c x as a function of position ( D is the diffusion coefficient). 
Sketch the steady-state solution as a function of position. 

Problem 12.4. Write the rate equations for the Brusselator near equilibrium and find the 
equilibrium values of c x and cy. Find the steady-state solutions for the system near 
equilibrium (assume it is spatially homogeneous) and show that far from equilibrium (A 
and B large and D and E eliminated as soon as they are produced) they reduce to the 
expressions Xq — A and Y 0 = B/A. 

Problem 12.5. Show that the Brusselator admits traveling wave solutions, c x (rO ± vt) 
and cy(rd ± vt), for linear motion on a circle, assuming periodic boundary conditions. 
Find the condition for a bifurcation from the thermodynamic branch to a travelling wave 
solution using linear stability theory. Show that the velocity, v, of the waves is a 
decreasing function of the wave number. 

Problem 12.6. A fluid is constrained between two smooth surfaces a distance d apart in 
a gravitational field with a temperature gradient across the fluid. If a Benard instability 
develops, show that it is possible to have square cells. Find the horizontal width of the 
cells in terms of d. Find an expression for the velocity, v, of the fluid and sketch the flow 
in each cell. Assume that at the cell walls the fluid flows upward and has its maximum 
speed. 

Problem S12.1. Locate the critical point in the Schlogl chemical reaction. Compute the 
correlation function for concentration fluctuations, just “below” the critical point and in 
the Gaussian approximation. Compute the correlation function for the chemical noise. 
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A.l. GENERAL FLUID FLOW 

Let us consider a continuous medium which moves with velocity v(x, t) at time 
t and position x. The position x is measured with respect to a fixed frame of 
reference whose origin lies at O x . In addition, it is convenient to introduce a 
second coordinate system whose origin lies at O z and which moves with the 
fluid. We assume that at time t — 0 the two coordinate systems coincide; and at 
time t = 0 we select a given fluid “particle” (small volume element of the fluid) 
which has position x(0) with respect to frame O x and position z with respect to 
frame O z . Because the coordinate systems coincide at time t = 0, z = x(0). 

Let us assume that both the frame O z and the fluid particle move freely with 
the fluid and that the fluid particle always has position z with respect to the 
frame O z . The position of the fluid particle with respect to the frame O x at time t 
is 


x(0 = x(z,r), (A.l) 

where z = x(0) is the initial position of the particle. Equation (A.l) relates the 
coordinates of the fluid particle as seen by observers in the two frames (ct. 
Fig. A.l). 

We now introduce a density @{x, t ), where &(x, t) could be a probability per 
unit volume, internal energy per unit volume, mass per unit volume, angular 
momentum per unit volume, and so on. At time t = 0, an observer in frame O x 
sees a density ^(x(0), 0) and an observer in frame O z sees a density @'(z, 0) in 
the neighborhood of the fluid particle, where ^(x(0),0) = ^'(z,0). At time t, 
an observer in O x sees a density @(x(z,t),t) and an observer in O z sees a 
density 3>'{z , t) in the neighborhood of the fluid particle, where 
@(x(z,t),t) = 2\z ,t). [Once the density is given in one coordinate system, 
it can be found in the other using Eq. (A.l)]. 

We shall denote the total time rate of change of the density in the 
neighborhood of the fluid particle by d,2fi/ dt. Then 
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*3 



Fig. A.l. Coordinates of a fluid particle at time t. 


However, (dx/dt) z is the velocity of the fluid particle: 



(A.3) 


From Eq. (A.2), we see that d<2>/dt — (83f jdt) z is the time rate of change of 
the density as seen by an observer moving with the fluid particle, and ( d3>/dt) x 
is the time rate of change of the density at a fixed point in space (not moving 
with the fluid). Combining Eqs. (A.2) and (A.3), we may write the total 
derivative d/dt in the form 


d d . . 

5“5 + T - v " (A ' 4) 

Equation (A.4) is often called the convective time derivative. It allows one to 
find the time rate of change at a moving point in terms of coordinates which are 
fixed in space. 

Let us now consider a given set of fluid particles contained in a volume 
element dVo at time t = 0. We wish to follow the same set of fluid particles with 
time and see how their volume changes with time. If we remember that 
x(0) = z, we may write 


dV t = f 


/*i, X 2 , X 3 \ 
\Zl j -Z2; £3 / 


dVo, 


(A.5) 


where 


dVo = dz = dx( 0) 


(A.6) 
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and 


dV t = d\(t). 


(A.7) 


In Eq. (A.5), x\ , X 2 , and x 3 are the components of the vector x(t), and zi,Z 2 , and 
Z 3 are the components of the vector z. The Jacobian is defined as 


/ 



x 2 

Z2 


dxi 

dx\ 

dx\ 

dzi 

dzi 

dz 3 

dx 2 

dx 2 

dx 2 

dzi 

dzi 

dz 3 

dx 3 

dx 3 

dx 3 

dzi 

dZ2 

dz 3 


(A.8) 


and may be evaluated from Eq. (A.l). From Eq. (A.5) we see that the Jacobian 
is a measure of the amount by which the volume of a given set of fluid particles 
changes as a function of time. The change in the Jacobian with time, as seen by 
an observer moving with the volume element, is df /dt. If we note that 


d dxi 
dt dzj 


d dxi 
dzj dt 


dvi 

dzj 


(A.9) 


(z is held constant when the differentiation d/dt is carried out), we may write 


= det 


dt 



(A. 10) 


However, 


dvi 


uvi ^ <9V( 0 x k 
dzj dx k dzj 


(A.l 1 


If we substitute Eq. (A.l 1) into Eq. (A. 10), then, in the first determinant on the 
right-hand side, only the term in the sum with k = 1 is nonzero, because for 
k = 2 or 3 two rows of the determinant become identical and therefore give 
zero. The second and third determinants are only nonzero for k = 2 and 3, 
respectively, We therefore obtain 
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If V x • v = 0, then df /dt = 0 and the size of the volume element remains 
constant with time; that is, the fluid is incompressible. If V x • v ^ 0, the volume 
element may expand and contract with time. 

A.2. GENERAL BALANCE EQUATION 

We now wish to find the equation of motion for an arbitrary density ^(x, t ). Let 
us consider a finite volume element Vo at time t = 0 containing a given set of 
fluid particles. At time t, that same set of fluid particles will occupy a volume 
V{t). The total amount of quantity D contained in V{t ) is found by integrating 
^(x, t) over the volume V(t): 

' f 

D(t)= 2>(x,i)dV t . (A. 13) 

m 

The rate of change of the amount of D in V(t) as seen by an observer moving 
with V(t ) is given by 

d i=iM\ m *' t)dV ’ (A - 14) 

V(f) 

Since the limits of integration depend on time, the derivative cannot be taken 
inside the integral. However, if we write the integral in terms of coordinates at 
time t = 0, we can take the derivative inside: 

?=K1I = 1 1 1 (f ■ ' - + ® d i) dv °- (A,5) 

Vo Vo 

In Eq. (A. 15), f is the Jacobian. Using Eq. (A. 12), we obtain 

S = 1 1 1 (? + • v ) / ^ = J 1 1 (? + • v ) ^ 

Vo V(f) 

(A- 16) 

The total amount of the quantity D in V ( t ) can change if there is a flow of D 
through the sides of V(t ) or a source of D in V(t). We can imagine a flow of D 
through the sides. An example would be the case when 3>{x,t) is the heat 
density at position x and time t, and the surrounding medium is at a different 
temperature. If we let J D represent a current of D through the sides of V(t) and 
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Go represent a source of D inside V(t), then we can write 

'd® 


dD 

dt 


V(t) 


dt 


+ • v dV t = 


a D dV t - 


J D -dS t . (A. 17) 




S(t) 


(The surface area element dS t is assumed to point out of V(t).) If we first 
change the surface integral to a volume integral (J J*^ Jo = V,- 

J D dV t ), then, since V(t) is arbitrary, we may equate integrands to obtain 


d® 

—— = —®V X • v + gd — V x • in- 

dt 


(A. 1 8) 


Equation (A. 18) is one form of the balance equation. If we use the definition of 
the convective time derivative given in Eq. (A.4), we may write the balance 
equation in the more usual form: 


d® 

dt 


— V x • (®\ + J d) + cr D . 


(A. 19) 


The quantity ®\ is a convective current of D. It is a current which occurs 
simply because the fluid is in motion. Equation (A. 18) gives the time rate of 
change of the density in the neighborhood of a point moving with the fluid. 
Equation (A. 19) gives the time rate of change of the density at a fixed point in 
the fluid. 

We can get a better understanding of the meaning of various terms in Eq. 
(A. 19) if we integrate it over a fixed volume, V/, in the fluid. Then 



™ dVf = - 
dt f 


v r 


(Sv + Jo) dV f + 



Gd dVf. 


(A. 20) 


Since the volume V/ is fixed, the limits of integration are time independent and 
the time derivative on the left-hand side can be taken outside the integral. Also, 
we can use Stokes’ theorem to change the volume integral of the divergence on 
the right-hand side to a surface integral. We then obtain 



(A. 21) 


where Sf is the surface of Vf and dS/ is an infinitesimal surface area segment 
whose direction is along the outward normal to the surface Sf. From Eq. (A.21), 
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we see that the rate of change of the amount of D in V/ is due, in part, to a flow 
of D through the surface and to the creation of D in the fluid. 
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SYSTEMS OF IDENTICAL PARTICLES 


The indistinguishability of a system of N identical particles lies in the fact that 
the Hamiltonian and all other physical observables are invariant under 
permutation of their dynamical variables [1]. This symmetry property of the 
Hamiltonian leads to a decomposition of the eigenstates of the Hamiltonian and 
all other states of the system into two types, either symmetric or antisymmetric 
under permutation of the N particles. Particles with symmetric eigenstates are 
bosons. Particles with antisymmetric eigenstates are fermions. Experiment 
shows that systems of particles with integral spin (photons, He 4 , etc.) are 
bosons. Systems of particles with half-integral spin (electrons, protons. He 3 , 
etc.) are fermions. 

In this book, we often consider systems of N identical particles with spin, 
confined to a finite or an infinite volume. We will usually consider Hamiltonians 
which can be decomposed into a kinetic energy and a potential energy, 

N (lW-1) 

= ( B >) 

i=l i< j— 1 

where m is the mass of a particle, p, and q, are the momentum and position 
operators, respectively, for the i th particle, and s, is the spin operator for the i th 
particle. If a particle has spin, then we must specify not only its momentum or 
position, but also the component of its spin along some direction in space (we 
choose the z direction). We will be interested in evaluating traces and 
expectation values in terms of complete sets of symmetized position and 
momentum eigenstates, and we will be interested in the relation between these 
states and the “number” basis. Therefore, in Section B.l we construct complete 
sets of symmetrized momentum and position eigenstates and show how to use 
them. In Section B.2 we show the connection between symmetrized position 
and momentum eigenstates and field operators. 


B.l. POSITION AND MOMENTUM EIGENSTATES 

In this section we derive expressions for the position and momentum 
eigenstates of a free particle and for a particle confined to a box with periodic 
boundary conditions. 
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B.1.1. Free Particle 

Let the ket, |r), denote the right eigenstate of the position operator, q, and let 
the ket, jp), denote the right eigenstate of the momentum operator, p, so that 
q|r) = r|r) and p|p) = p|p). It is often useful to write the momentum operator 
as Tik, where k is the wavevector. Then pjk) = 7ik|k). The left eigenstates of the 
position and momentum operators are found by taking the Hermitian adjoint 
(denoted byt) of these equations (note that (q|r))^ = (r|qt = (r|q, since the 
position operator is Hermitian, qt = q). Thus, the left eigenvalue problem can 
be written (r|q = (r|r and (p|p = (p|p. The left and right eigenvectors are 
orthonormal (r'|r) = ^(r' - r) = 6(x' - x)6(y' - y)6 (z' - z ) and (p'|p) = 
W - p) = 6{p' x - p x ) 8{p' y - p y )6(p' z -p z ) or (k'|k) = 6(k' - k) = 

8(k' x — k x )8(k' y — k y )8(k' z — k z ). The left and right eigenvectors form complete 
sets 

* r 

dr|r)(r| = i and dp|p)(p| = i or ^/k|k)(k| = i, (B.2) 

where the integrations are taken over the entire allowed range of r, p, and k, 
respectively. For a free particle, this is -oo < x < oo, -oo < y < oo, and 
— cxd < z < oo. Also, — oo < p x < oo, — oo < p y < oo, and — oo < p z < oo and 
similarly for k. The momentum operator in the position basis is defined as 


<r'|p|r)=«(r'-r)*V r , (B.3) 

and a function of the momentum operator in the position basis is defined as 

( r 'l/(p)l r ) = <5(r' - r )f(j V r ^ . (B.4) 

Similarly, the position operator in the momentum basis is defined as 

<p'|q|p) = <5(p' - p) ~Y Vp. (B.5) 

and a function of the position operator in the momentum basis is defined as 

(p'l/(r)|p> = -P )/ (-y v p)> (B.6) 

We can compute the momentum eigenstate in the position basis, <^> p (r) = 
(r|p) and <£k(r) = (r|k). It is the solution to the equation 

(r|p|p) = p(r|p) or - V r (r|p) = p(rjp). (B.7) 
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The orthonormalized solution to Eq. (B.7) is 

<r|p> = 0 P (r) = (^) ' exp or <r|k> = *(r) = (~) ' . 

(B.8) 

The normalized momentum eigenstate in the momentum basis is 

(p'Ip) = <Mp') = % - pO or (k'|k) = 0 k (k') = 6 ( k - k'). (B.9) 

Similarly, the normalized position eigenstate in the momentum basis is 

w=^=fc) ,/!ra p(T) 

or (k|r) = 0 r (k) = 

and the normalized position eigenstate in the position basis is 

(r , \r) = Mr , ) = S(r-r'). (B.ll) 



B.1.2. Particle in a Box 

Let us now assume that the particle is confined to a cubic box of volume 
V = L\ and let us assume that the position eigenstates satisfy periodic 
boundary conditions 0k(r) = 0k(*, y, z ) = 0 k(* + n x L, y + n y L, z + n z L), 
where n x , n y , and n z are integers ranging from —00 to 00 (we will just consider 
the wavevector basis here). This will be true if the momentum operator is 
written p = fck, and the wavevector k takes on discrete values 

O'TT 

k = — (n x i + n y j + n z k). (B.12) 

We denote the complete set of orthonormal eigenvectors of k by |k). They obey 
the orthogonality condition (klk 7 ) = 6^ r k ,, where 6^ k ,, is the Kronecker delta 
function. Completenes of the states |r) and jk) gives 


dx 


dy 


dz |r)(r| = 1 and 


E E E ww = i. 

t =— 00 n y =— 00 n z =— 00 


0 


(B.13) 
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The wavevector eigenstate in the position basis is then 

(r|k) = 0k(r) = ^2 eikT - ( B - 14 ) 

The position eigenstate in the wavevector basis, (k|r), is just the Hermitian 
adjoint of (rjk). 


B.2. SYMMETRIZED iV-PARTICLE POSITION AND 
MOMENTUM EIGENSTATES [2, 3] 

We will let |r) = |r, s z ) — |r)|j z ) denote the eigenstates of both the position 
operator q and the z component of spin, s z , and we will let | k) = |k, s z ) denote 
the eigenstate of both the wavevector k and the z component of spin. We shall 
often refer to | k) as a “momentum” eigenstate. 

The position eigenstates of the A-body system may be written as the direct 
product of the position eigenstates of the constituent particles, 


\r a ,r b ,...,ri) = \r a ) x \r b ) 2 x ••• x |r z )^. (B.15) 

On the left-hand side of Eq. (B.15), we use the convention that the positions of 
the particles labeled from 1 to A are ordered from left to right from 1 to A. 
Thus, particle 1 has position r a , particle 2 has position r b , . . ., and particle A 
has position r/. On the right-hand side of Eq. (B.15), the ket, is a 
position eigenstate of particle i. (When we say “position” we include spin if 
the particle has spin.) A “momentum” (and spin) eigenstate of the A-body 
system is written 


kl bl • • • ) kl) I k a ) 1 1^)2 X ' ' ' X |&/) yy. (B. 16) 

where particle 1 has momentum k a , particle 2 has position k b , and so on. 

In the position representation, the Hamiltonian takes the form 



(B- 17) 


For the case when the Hamiltonian (and the other dynamical variables) are 
symmetric under interchange of particles, we must use only symmetric or 
antisymmetric combinations of position or momentum eigenstates when 
computing expectation values. We will describe these states for the cases of 
Bose-Einstein and Fermi-Dirac particles separately. 
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B.2.1. Symmeterized Momentum Eigenstates for 
Bose-Einstein Particles 

By definition, any wave function describing the state of a system containing N 
identical Bose-Einstein particles must be symmetric under interchange of 
coordinates of any two particles. Furthermore, any number of bosons can have 
the same quantum numbers. 

Let us first introduce a permutation operator, Py, such that Py interchanges 
the momenta of particles i and j: 

P ij\ki ,k 2 ,..., ki , kj, ...,K N ) = \ki,k 2 ,..., kj , k u ...,k N ). (B.18) 

We shall use the following notation for the sum of all possible permutations: 

E P\k\,k 2 , . . . , k^f) = E (all AH permutations of 
p (B.19) 

momenta in \ki,k 2 , . . . , k N )). 

For example 

V P\k u k 2 ,k 3 ) ={\k x ,k 2 ,k 3 ) + \k 2 ,k h k 3 ) + |*i,*3,*2> 
p (B.20) 

+ \k 3 ,k 2 ,ki) + \k 3 ,k u k 2 ) + \k 2 ,k 3 ,k x )}. 


Note that in \k 2 ,k 3 ,ki), particle 1 has momentum k 2 , particle 2 has momentum 
k 3 , and particle 3 has momentum k\. 

If in the ket \ki,k 2 , . . . ,kpj) there are n\ particles with momentum k\ : n 2 
particles with momentum k 2 ,n 3 particles with momentum k 3 , and so on, then 
P\ki,k 2 ,k 3 ) contains N\ terms, but only n a \ of them are 

different. In P\ki,k 2 ,k 3 ) each term is repeated n a- times. If we now 
use the orthonormality of momentum states, 

(k a , kb, , ki\k'a,k'b, . . . ,k'i) = h a ,k> a Kr b x • • * x 6 khk >, (B.21) 


we find that 


• • • ,k N ) {S) = 


V N '- U a=i 


E ^l^ 1 ’ • • • ,kn) 


(B.22) 


is a symmetrized orthonormal N-body momentum eigenstate. The symmetrized 
states have the normalization 


^ <* 1 , * 2 , - - - , * 2 , *»> <S) = 1 


(B.23) 



SYMMETRIZED N-PARTICLE POSITION 


779 


and satisfy the completeness relation 

X ( fl n “ ! ) \ki,k 2 ,...,k N ) {s) {s) (ki,k 2 ,...,k N \. (B.24) 

' ki,...,k N \a=l / 

The factor n a \/N\ in the summation comes from the fact that the 

summation produces A^!/n^=i n a- copies of each term. 

B.2.2. Antisymmetrized Momentum Eigenstates for 
Fermi-Dirac Particles 

Any wave function describing the state of a system containing N identical 
Fermi-Dirac particles must be antisymmetric under interchange of coordinates 
of any two particles. Because of the Pauli exclusion principle, no more than one 
fermion can have a given set of quantum numbers. 

An antisymmetrized momentum eigenstate may be written 

1*1 . *2, • ■ • , £ (- 1)' P\kx *AT>, (B.25) 

where ( - 1 ) /> = + 1 for an even number of permutations and ( - 1 ) p = — 1 for an 
odd number of permutations. For example, 

X (~ l ) P p \ k u k 2 ,k 3 ) ={\ki,k 2 ,k 3 ) - \k 2 ,ki,k 3 ) - \k u k 3 ,k 2 ) 
p (B.26) 

- \k3,k 2 ,ki) + \k 3 ,k h k 2 ) + \k 2 ,k 3 ,ki)}. 

The antisymmetrized states have the normalization 

{A) (k,,k 2 k N \k u k 2 ,...,k N ) W = 1 (B.27) 

and satisfy the completeness relation 

E \kuh,...,k N ) w ^(k u k 2 ,...,k N \. (B.28) 

' ki,...,k N 

Note that if any two particles have the same momentum and spin, then the state 
\ki,k 2 , . . . , k N )^ is identically zero. For example, \ki,ki, . . . ,k N )^ = 0. 

The wave function (r \ , r 2 , . . . , ru\k \ , k 2 , . . . , k ^) ^ can be written in the form 
of a determinant, 

(n\k 2 ) ••• (ri|^)\ 

(rilki) ••• (r 2 \k N ) 

; • ) 

(r N \k 2 ) ••• (r N \k N ) / 

(B.29) 

known as the Slater determinant. 


(r u r 2 , . . . , r N \k u k 2 , . . . , k N ) {A) = ~^= 


( W*i) 
{ r i\k\) 

KMki) 
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B.2.3. Partition Functions and Expectation Values 

When evaluating the properties of many-body systems, we must compute 
partition functions and expectation values. Some of these quantities can be 
simplified as we shall show here. 

The trace for an V-particle boson system can be written 

Tr (#v)=^y ^2 \ II " q! ) {S) ( k uk2,---,k N \p N \k u k 2 ,...,k N ) {s) 

k u ...,k N \a=l / 

= (th) ^2 {+) (ki,k 2 ,...,k N \p N \ki,k 2 ,...,k N ) {+) (B.30) 

k u ...,k N 

= 4 ^ (ki,k2,...,k N \p N \k u k 2 ,...,k N ) i+ \ 

ki,...,k N 

where 

| k u k 2 ,...,k N ) M = £ />|*!, * 2 , (B.31) 

P 


Similarly, for an vV-particle fermion system we have 


Tr(pv) = Try ^ ^(&i,&2, • • • ,M/W|£i 5 &2> • • • 

k u ..,k N 

— Try ^2 (^ 1 )^ 2 , • • • ,&Ar|Av|&i, &2, • • • , \ 

ki,...,k N 


where 


\ki,k 2 ,...,k N ) { } = ^2{-l) P P\k\,k 2 ,...,k N ). 

p 


(B.32) 


(B.33) 


It is useful also to look at expectation values. Let consider the W-particle one- 
body operator 

C® = Y o,, (B.34) 

i=l 

where 0, depends only on momentum and coordinate operators for particle t. 
The kinetic energy is an example of such an operator. The expectation value of 
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a one-body operator for a system of N bosons can be written 


(d< 1 >>=Tr A ,(6(, 1) p w ) = -l Y, ffU’ 

' ki,...,k N \a=l , 


x (5) (ki ,k 2 ,..., k N \OW p N \ki ,k 2 ,..., k N ) 

(w) E E E <+) <*> 

V v -/ k/ v ...,k' N i= 1 


|(0 




/ \(+) 


X (+) <*', fc^|p/v|*i k N ) M 

= Af(E) E E (ki,...,k N \di\k' l ,...,k' N ) i+) 

ki,...,k N k v ...,k! N 

x {k [ , . . . , k^\p^\ki , . - . , 

= 7^E E (^1 |Oi 1^) (+) {k[ ,k 2 ,...,k N \p N \ki,..., k N ). 

' '' /P, *1>->*AT 

(B.35) 


We can rewrite (O^) in terms of the reduced probability density (k\ |/5(!) |^i): 


m = ee^i 6 '^) <*iiA»i*i> (b- 36 ) 

k\ k l 


For bosons, the reduced one-body probability density, (^ilp(i)l^i), is defined as 

{JfciOo)!*,) =— L— Y W* k N ). (B.37) 

^ >' k 2 ,...,k N 

For fermions, the reduced one-body probability density is defined as 

{fc'.IPd)!*,) =— -i— Y h) {k\M k„\Mh k N ). (B.38) 

' ' k 2 ,—,kN 

Analogous expressions can be written for two-body operators, but we will leave 
that as an exercise. 


B.3. THE NUMBER REPRESENTATION 

The wavefunctions |&i, . . . , k N ) tell us which momentum state each particle is 
in. However, the states |jfci, . . . ,k N )^ A ^ contain no such information because 
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they contain permutations of particles among all momentum states listed in the 
ket. Also, they can be clumsy to work with. Therefore, it is convenient to 
change representations from one which tells the momentum state of each 
particle (the momentum representation) to one which tells us the number of 
particles in each momentum state (the number representation). 


B.3.1. The Number Representation for Bosons 

The basis states in the number representation are written |«i , n 2 , • • • , «oo), where 
n a is the number of particles in the state of momentum and spin k a . Operators in 
the number representation are written a\ and a * and are called creation and 
annihilation operators, respectively. They are Hermitian conjugates of one 
another. 

The states and operators of the number representation are related to the N- 
particle states |ki, . . . ,k m )^ as follows: 

|ki,...,k m ) (5) = |0,...,ni,...,n m , 0 -> 0) 

= — 7 === K)” K)" ! X X (a£)"-| 0 >, (B. 39 ) 

\f[ n J 

V a=l 

where Yl?=i n ‘ = N. We have labeled the momenta in \k\ , . . . , from 1 to 
m. However, it is understood that there are N positions in the ket. Several 
particles may have the same momentum. The notation 0 — > 0 means that all 
occupation numbers after n m are zero. Note that zeros may be interspersed 
between occupation numbers n \, . . . ,n m . If we interchange momenta of two 
particles, the wavefunction should not change: 


\k\,k 2 ,. . . ,k m ) {s) = \k 2 ,k u . . . ,k m ) {s) 

1 


nj 

a 

_1 

/ 00 

H n a \ 


KT K) ni x • • • x «)"1°> 


H) n ' 'KKHr' x ••• x (a^no). 


Thus, we find that symmetrization of the wavefunctions gives 

/S-L^-L ^4-^4- 


(B.40) 

(B.41) 
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and, therefore, creation operators commute. If we take the Hermitian adjoint of 
Eq. (B.40), we find 

(S) (k\ ,k 2 ,...,k m \ = (5) (k 2 , k\ , . . . , k m \ 

< 0 | (a km rx---x(a k2 r {a k T 


<0| {a km ) nm x ••• x (a k2 ) n2 l a kl a k2 (a kl ) ni *• 


(B.42) 

Thus, annihilation operators also commute: 

a kx a k2 = a kl a kr (B.43) 

The wavefunctions must satisfy the following condition: 

= sr — (0| a*, x ■ • ■ x at.aj x ■ ■ • x a£|0) 

n nj (B.44) 

a 

JO if the sets [k u ... k m ) and [k \ , . . . k' m ] are not the same. 

\ 1 if the sets [&!,... k m \ and [k \ , . . . k' m ] are the same. 

Equation (B.44) will be satisfied if the state |0) and the creation and 
annihilation operators have the following properties: 

<0|0) = i, (B.45) 

a*|0) =0 for all k, (B.46) 

and 

[«*,«?]_ = a k tf, - = 6 kjk >. (B.47) 

One can easily show that Eq. (B.44) follows from Eqs. (B.45)-(B.47) by doing 
some examples. One simply commutes each annihilation operator to the right 
using Eqs. (B.45)-(B.47) as they are needed. The result is Eq. (B.44). 

Again, by using Eqs. (B.39), (B.46), and (B.47), we can show that 

&kj |/ll , . . . , /ly, . . . , /loo) = y/fij 1^1 1 • • • ^oo) i (B.48) 

flj|ni,...,ny,...,ii 00 ) = yjrij + T|/ii, . . . ,/iy + l,...,/ioo), (B.49) 
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and 


(ni , . . . , rij , . . . , Woo \a£.a kj |m , . . . , nj, . . . , n^) = nj. (B.50) 

Equation (B.50) follows from the normalization of the states: 

= 1. (B.51) 

When working with N- body systems, we commonly must evaluate one-body 
operators of the type 


N 

0\ i) = Y (B.52) 

i= 1 

(Note that these operators are symmetric under interchange of particle labels.) 
Some examples are the kinetic energy, the number density, and the particle 
current. We wish to express these operators in the number representation. 

The correspondence between <9^ and its analog in the number representa- 
tion is 

6(1) 6$ = Y ( k a\Oi(quPi,Oi)\k b )ala kh . (B.53) 

k a k.b 

To show that Eq. (B.53) is correct, we must show that 

^(k u ...,k m \d N m \k\,...,k' m ) {S) 

= 0-* 0 10(f) ri m , 0 — » 0). 

We leave this as an exercise. 

In going from the momentum representation to the number representation, 
we thus make the correspondence 

— >0 « m ,0 — 0) (B.55) 

and 

Y i,*i) -^Y( ka \ 0l (^LPi> dl )l^> a *A- ( B - 56 ) 

i=l k a ,kb 

In practice, the only other type of /'/-body operators we encounter are two-body 
operators of the form 

(l/2)N(N-l) 

6(2)= 53 


(B.57) 
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Some examples are the potential energy and the momentum currents (stress 
tensor). Similarly, we can show that 

(l/2)N(N-l) j 

Y °ij(*LiAj,Pi,i>j,Oi,Oj) Y (WOniq^ViWi^Kk'b) 

i<j L kaKWh 

(B.58) 


when going from the coordinate to number representation. 


B.3.2. The Number Representation for Fermions 

The basis states for fermions are also written |nj,/i 2 , . . . ,«oo), but because of 
the Pauli exclusion principle the occupation numbers take on values n a = 0 or 1 
only. We also introduce creation and annihilation operators a£ and a k , 
respectively. But they obey different commutation relations. 

The correspondence between states in the number representation and 
antisymmetrized momentum states is 


|0, . . . , 1] , l 2 ,...,l„,0-*0) = a+a+ X ••• XfljjO) 


= |ki,&2, • • • >^v) 


(A) 


(B.59) 


where we have used the convention n a = \ a when the state k a is filled and zero 
otherwise. In general, there will be zeros interspersed between the occupation 
numbers 1 1 , 12 , . . . , In on the left-hand side of Eq. (B.59). If we interchange 
momenta of two particles, the states must change sign: 

l*i, *2, • • • ,k N ) w = -|i 2 , k u ■ ■ -,kN) W , B _ 

= KK x • • • X a+ 10) = -a+a+ X ... x a+ 10>. 

Thus, the creation operators anticommute, 

KK = -KK- ( B - 61 ) 

If we take the Hermitian adjoint of Eq. (B.60), we find 

a k2 a kl = - a kl a kl . (B.62) 


The annihilation operators also anticommute. 
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The wave functions must satisfy the condition 


(A) 


(k\ , i k^\k 


(A) 


= 


f 0 

+i 


-i 


x • • • x a k at, x • • • x at |0) 

*1 K n ‘ 

if the sets {£,} and {£■} differ, 

if the sets {&,} and {&'} are the same but differ by 

an even permutation, 

if the sets {£,} and {k\} are the same but differ by 
an odd permutation. 


(B.63) 


Equation (B.63) will be satisfied if 


and 


<0|0) = 1, (B.64) 

fl*|0)=0 for all k, (B.65) 


[a k ,a£] + = a k a£, + a%a k = 6 k<k >. (B.66) 


To show Eq. (B.63) for the case of identical states, we use Eq. (B.61) to permute 
operators until 

a*; x ••• = ±a *l * xa ‘»' ( B - 67 ) 

There will be a plus sign if an even number of permutations are needed, and 
there will be a minus sign if an odd number are needed. One can then show that 

x • • - x a kl al x • • • x |0) = 1 (B.68) 

by permuting each annihilation operator all the way to the right using Eqs. 
(B.65) and (B.66). 

For Fermi systems there can be ambiguity in the overall sign of the states. 
Therefore, one uses the convention that in the states |/ii, /i 2 , . . . , /too) the 
occupation numbers are ordered from left to right in order of increasing energy- 
We then find that 


«*;|wt,rt 2 , • ,rtoo) = (-1) E; y/nj \ni,n 2 , . . . , (1 - nf ), . . . ,/ioc), 

(B.69) 

d^\ni , / 12 , , /too) = ( 1) J \/l n j l^i i n 2) • • • ) (1 "F rifi . . . , /ioo)i 

(B.70) 



THE NUMBER REPRESENTATION 


787 


where J2j = ]C/=i n i an d we have used Eqs. (B.61), (B.62), and (B.64)-(B.66). 
In Eqs. (B.69)-(B.71), the occupation numbers may equal 0 or 1. 

For systems of identical fermions, the same correspondence holds between 
operators in the coordinate representation and number representation (cf. Eqs. 
(B.56) and (B.57) as was found to hold for systems of identical bosons. 
However, for fermions the creation and annihilation operators anticommute and 
for bosons they commute. 

To establish this correspondence, we must show that 

w k N \d" m \k\,...X N ) w 

= (0 li, . . . , 1«, o -> OldftlO, . . . , 1,, . . . , 1„, 0 - 0). 

(B.72) 

In Eq. (B.72) we have used the ordering convention discussed above. We will 
assume that the occupation numbers and momenta are labeled in order of 
increasing energy, and that the states, k\ , . . . , k N are all filled. We leave this as 
an exercise. 

A similar correspondence holds for the operator O^y However, we must 
make a brief comment about sign conventions. The correspondence is 

(l/2)N(N-l) 

6 N (2) = Y2 

i i<j (B.73) 

->2 S <^>^l 0 i2(qiq 2 PiP2^i^2 )\k' a ,k' b )a+a+a k >a K . 

W a k' h 

Note that the order of the operators a k ' a and a k ' b is opposite to the order of k' a and 
k' b in the ket \k' a k' b ). The reason is most easily seen by evaluating a matrix 
element of the operator on the right-hand side of Eq. (B.73) with respect to a 
state containing only two particles. We find 

( 0 , . . . , 0 , 1 2 , 0 , . . . , 1 1 , 0 , . . . . . . , 0 , 1 3 , . . . , 1 4 , . . . , 0 ) 

= (0\a k2 a ki a+a+ 10) 

= +^l,a^2,b^3,a'^4,ba' ~ fa, a fa,b fa, a' fa,V ~ fa, a fa,b fa, a’ fap 
+ fa,a^\,bfa,a'^4,b' 


and, hence, 


= {k\k 2 \dx 2 \hU) - {k\k 2 \dn\k4h) . 



788 


SYSTEMS OF IDENTICAL PARTICLES 


Thus, with this convention, the matrix elements whose momenta appear in the 
same order (of increasing or decreasing energy) on each side have a positive 
sign. 


B.3.3. Field Operators 

In Sections D.3.1 and D.3.2, we have written general one- and two-body 
operators in the number representation using momentum and spin as 
coordinates. We can also write them using position and spin as coordinates. 

Let us first consider the single-particle operators in Eq. (B.52). If we insert a 
complete set of position (and spin) states |r) into the matrix element, we find 


0{ f) = 

k a kb 

= [ \ drtdr2(ri\0(q,p,6)\r 2 )i> + (r l )if’{r2) 


(B.74) 


where 


and 


i’(i') = 


• 

k 

Similarly, for two-particle operators we find 

0(2) = 1 h\0(q t , q 2 , Pi, p 2: ff] , k a )d^a ka a tr 


(B.75) 


(B.76) 


k a kbk c kd 


H-I 


dr\ • • • dr4 


x ^^210(4!, q 2 ,p 1 ,p 2 ,Oi,ff2)k3r4)V’ + (ri)^ + (r2)^(r 4 )^(r3). 

(B.77) 

It is easy to show that the momentum-dependent commutation relations 

["k,A,«k',A']± = 5 k,k^A,A' ( B - 78 ) 

are equivalent to position-dependent commutation relations 

[^(r),^(r')] ± = «(r-r')« A , A . (B.79) 

for bosons and fermions (+ for fermions and — for bosons). 
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It is worthwhile to give a few examples of commonly used operators. The 
number operator may be written 


(B.81) 


N 

N " = XI li ^ti nb = Y^ drift (r)^A(r) = XI XI a M a M- ( B - 80 ) 

i= 1 A A k 

The number density operator may be written 

* w (qi. • • • ,q«;R) = E *(4 - *) - MR) = E ^ (R))iMR) 

i= 1 A 

A k,k 2 

The spin density operator is given by 

N 

S^(qi = XI a t 6( % - R) 

i=i 

- MR) = E (A,|«|A 2 )V-+ (R)^(R) 

AiA 2 


k 2 ,A 2 ' 


(B.82) 


The kinetic energy operator is written 

" n 2 B 


_ y' jyi _ y' Y' '+ - 

“ 2m nb ~ S V 2m J 

1=1 A k 


= (r) 


A k 

. + ,_w-* 2 VJ 


2m 


(B.83) 


V'a(t). 


The potential energy operator takes the form 


(l/2)N(N-l) 

v*- E 

'<7=1 

= 9 XI ^ r i^r 2 (ri, Ai; r 2 A 2 | Vi 2 |ri, A 3 ; r 2 , A 4 ) 


Ai,...,A4 
/.+ 


X ^A,( r l)^( r 2)!?A4(r2)^A3(ri) 

= 9 XI (kb A, ; k 2 , A 2 | V|k 3 , A 3 ; fcj, (B.84) 

^ k,,...,k 4 


Ai,...,A 4 
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APPENDIX C 


STABILITY OF SOLUTIONS TO 
NONLINEAR EQUATIONS 


C.l. LINEAR STABILITY THEORY [1-3] 

Any set of nonlinear differential equations of arbitrary order can be reduced to a 
larger number of first-order nonlinear differential equations. Therefore, to study 
the stability of a system, it is sufficient to study the stability of the set of first- 
order nonlinear equations governing the behavior of the system. 

For simplicity, we will consider a system of two nonlinear first-order 
differential equations: 


~^ = y^ =Myu yi) (C-l) 

and 

^ = h=hiyuyi)‘ (c.2) 

If there exist positive numbers k and 6 such that for given values of coordinates 
y® and y® the functions /(yi, >>2) satisfy the condition 

l/Ky?,^) -Myuy2)\^k ^ \yJ-yj\ 

j = 1 

for |y 9 — yj\ < 6 , and if the partial derivatives of fi(y 1^2) exist and are 
continuous, then the functions fi{y\,yi) satisfy Lipschitz conditions and the 
differential equations (C.l) and (C. 2 ) have unique solutions in the neighborhood 
y°i ,^2- We shall always assume that the above conditions are satisfied. 

In general, the set of Eqs. (C.l) and (C. 2 ) will have a number of steady state 
solutions (which are often called fixed or singular or critical points). These are 
solutions y, y2 which satisfy the conditions 


y\ =/i(yr,y2) =o 


(C. 3 ) 
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and 

yi = fi{y 1J2) = o. (c.4) 

We may plot the solutions of Eqs. (C.l) and (C.2) in a two-dimensional phase 
plane. The time-independent steady-state solutions correspond to points in the 
phase plane which do not change their position with time. The time-dependent 
solutions form trajectories in the phase plane. The equation for the trajectories 
can be found by dividing Eq. (C.l) by Eq. (C.2): 

dy\ == /i(yi,y2) , , 

dyi h{y\,yi)‘ 

Equation (C.5) uniquely relates y\ to y 2 for a given value of y 2 . 

In general, a given steady state solution will occur at some point y\ = A and 
y 2 = B in the phase plane. It can always be moved to the origin by the change of 
variables y\ =yi — A and y' 2 =y 2 — B. We therefore wish to study the stability 
of a given steady state solution to Eqs. (C.l) and (C.2) which occurs at the 
origin. 

Consider a system which is described by Eqs. (C.l) and (C.2) and which is in 
the steady state yi, y 2 . We often want to know how the system behaves under 
the influence of a small perturbation. Will the system leave the steady state and 
move to a different state? Will it decay back to the steady state? Or will the 
system remain in the neighborhood of the steady state and not decay back? 

With linear stability theory, we can say something about the stability of a 
very small neighborhood of the steady state. We can say nothing about the 
global stability of the phase space. To examine the stability of solutions in the 
neighborhood of the steady state, let us expand y\ and y 2 about the steady state, 

(C.6) 

and 

y2=y2+*2, (C.7) 

and let us expand both sides of Eqs. (C.l) and (C.2) about the steady state. We 
then find 
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For very small values of jc, (for solutions very close to the steady state), we can 
linearize Eqs. (C.8) and (C.9) and obtain 


Xi 

X2 



where 


(C.10) 


and 



(C.ll) 


a = 



b 




and 



We next transform to normal coordinates of Eq. (C.10). If we find the 
eigenvalues of A and expand Eq. (C.10) in terms of the right eigenvectors of A, 
which we denote z\ and zi (cf. Section 6.C), Eq. (C.10) then takes the form 


z\ 

Z2 




where the eigenvalues are determined by the condition that 


det 


a — X 
c 


b 

d- A 


= 0 


(C- 12) 


(C.13) 


or 


X± = l {(a + d) ± \J (a + d) 2 — 4(ad — be)}. (C.14) 

For a conservative system, which is area-preserving, Eq. (C.14) simplifies 
because det |A| = ad — cb = 1. If the coordinates initially have values z® and 
z\, at a later time they will have values 

z\{t) = e x+t z\ and z 2 {t) = e Xt z° 2 . (C.15) 

The eigenvalues, in general, are complex. We can consider several cases which 
can occur depending on values of the real and imaginary parts of A + and A_: 


(a) Fig. C.la: Both A + and A_ are real and A_ < A + < 0. This is a 
completely stable case. A solution displaced from the steady state will 
decay back to the state, exponentially. 
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Z2 Z 2 22 



Fig. C.l. Different types of fixed points: (a) Both A + and A_ are real and A_ < A. f < 0; 
(b) both A + and A_ are real and 0 < A_ < A + ; (c) both A + and A_ are real and 
A_ < 0 < A f ; (d) both A + and A_ are pure imaginary with A + = i(3 and A_ = -i(3 
(z + = C- + K+ and z~ = C+ - K -); ( e ) both A + and A_ are complex, 
A + = a + i(3, X- = a - i(3, and a < 0; (f) both A + and A_ are complex, A + = a + i/3, 
A - — a = —i/3, and a > 0. 


(b) Fig. C.lb: Both A + and A_ are real and 0 < A_ < A + . This is a 
completely unstable case. A solution displaced from the steady state will 
move away exponentially. 

(c) Fig. C.lc: Both A + and A_ are real and A_ < 0 < A + . This is an unstable 
case. A solution which starts with a nonzero value of zi will move away 
from the steady state. (Note that the trajectories moving along the z f 
axis never reach the steady state. It is a separate solution.) 

(d) Fig. C.ld: Both A + and A_ are pure imaginary; that is, A + = i(3 and 
A_ = —i(3. To find the stability in this case, one must consider nonlinear 
terms. The solution neither decays nor moves away from the steady 
state. It simply oscillates around it. In Fig. C.l d we have let 
Z\ = Ci + Ki and Z2 = Ci - Ki, therefore, 

((A = ( cos (3 sin/? W Ci\ 

\CiJ sin/? cos P)\(2j 

(this is a case of marginal stability). For this case, the fixed point is often 
called a center. 



LIMIT CYCLES 


795 


(e) Fig. C.le: Both A + and A_ are complex, so A +=a + i/3 and 
A_ = a - i/3 where a < 0. This is a stable case in which the solution 
spirals toward the steady state (which is called a focus). 

(f) Fig. C.lf: Both A + and A_ are complex, so that A+ = a -\-i(3 and 
A_ = a — i/3 and a > 0. This is an unstable case in which the solution 
spirals away from the steady state. 

The linear stability analysis provides a very simple tool for studying the 
stability of a steady state. For example, if the real parts of A+ and A_ are both 
negative, the steady state is stable. It can be shown that the nonlinear terms 
which were neglected in the stability analysis do not change this fact (Ref. 1, p. 
32). Similarly, if at least one of the eigenvalues has a positive real part, the 
steady state will be unstable. Inclusion of the nonlinear terms cannot make it 
stable. 

When linear stability analysis reveals a center (d), then one must look at the 
effect of nonlinear terms on the stability. They will, usually (but not always), 
change the center to a stable or unstable focus. 


C.2. LIMIT CYCLES 

In Section C. 1 we studied conditions under which steady-state solutions might 
be stable. We found that if a steady-state solution was stable, then all states in 
the neighborhood of the steady state would decay to it after long times. 

For nonconservative nonlinear equations, it is also possible to find stable 
periodic solutions. Such solutions are called limit cycles. If a periodic solution 
is stable, then all solutions in its neighborhood will decay to it after long times. 


EXERCISE C.l. Show that the set of equations 


dy i = , yi(i -y] -yj) 

* 

admit a limit cycle. 


dy 2 , y 2 (l-yi-A) 

and 17 = ~ yx + / 2 7 " 2 ~ 

dt VAT +y\ 


Answer: If we change to polar coordinates, yi = r cos(0) and y 2 = r sin(0) 
these equations become 


dr 2 a i 

— = 1 - r and — = 1. 
dt dt 

If we integrate the first of Eqs. (1), we find 
dr 


Ro 


1 


rt A 1 . /T+r' 

dt = t = - In 
o 2 


( 1 ) 


(2) 


Ro 
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or 

Ap 2t — 1 

T- (3 > 

where A = (1 +R 0 )/( 1 — Ro). From Eq. (3) we see that regardless of the 
value of R 0 (unless Ro = 1), we have R(t) — » 1. If Ro = 1, the R(t) = 1 for 
all time t. Therefore, Eqs. (a) and (b) admit a periodic solution with radius 
R = 1, and all trajectories decay to it after long times. If a solution starts 
with Ro = 1, then it will keep that value for all times (see the accompanying 
figure). Not all limit cycles are stable. One can find limit cycles for which the 
neighboring trajectories move away in time. 


y 



Not all limit cycles are stable. One can find limit cycles for which the 
neighboring trajectories move away in time. 


C.3. LIAPOUNOV FUNCTIONS AND GLOBAL 
STABILITY 

In Section C.l, we discussed methods for determining the infinitesimal stability 
of stationary solutions. Liapounov introduced a means of determining the 
stability of a finite region of the phase space surrounding a stationary state. 

Let us again consider Eqs. (C. 1) and (C. 2) and assume that a steady state 
occurs at yi =y >2 = 0. If we can find a function V(yi , ^ 2 ) such that for a region 
of space (D : |yi| < h,\y 2 \ < h ) around the steady state, V(yi,y 2 ) > 0 (assume 
V(0, 0) = 0) and dV/dt^O), then the steady state will be stable. In other words, 
if V > 0 (except at the origin) and dV/dt^O, a trajectory which starts in D 
remains in D for all time. 

The region D forms a square with sides 2h centered at the origin. Consider 
two more squares with sides 2s and 26 in D such that 6 < s < h (cf. Fig. C.2). 
Let B h denote the boundary of square 2h, B £ the boundary of square 2s, and B b 
the boundary of square 26. We now let a denote the lower bound of V (yi , >> 2 ) on 
B £ . Thus on B £ ,V^a> 0. 

Let us next consider a point y 0 = (yi( ? o),y 2 (?o)) contained in the square B b . 
at time to and let us follow its motion with time. At time t, the point will have 
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V2 



Fig. C.2. Regions of phase space used 
to prove stability of the steady state if 
Liapounov’s criteria are satisfied. 


coordinates y t = (yi(f),y 2 W)- We shall choose B 6 so that V(yi(fo),;y 2 (fo)) < 
Then since dV/dt^O we must have 

V{yi{t),y 2 {t))^ ^(ji^o),^)) < <*• 

Therefore, for all times t > to, the point y t = (yi (t),y 2 (t)) must remain inside 
the boundary B e and the steady state is stable. 

An identical argument can be made if V(y\ ,>> 2 ) < 0 in D and dV/dt^O. If 
such a function can be found, then again, the steady state in stable. 


■ EXERCISE C.2. Consider a damped harmonic oscillator whose equation 
of motion is d 2 y\/dt 2 + a(dy\/dt) + ufyi =0. (a) Show that the total 
energy, \y\ + \y\ = E, can be used as a Liapounov function for this 
problem, (b) Locate and classify the steady states for the cases a > 0 and 
a — 0 


Answer: 

(a) Introduce the variable y 2 , where dyi/dt = . Then the equation 

for the dampled harmonic oscillator can be written as two first order 
equations 

dyi , dy 2 0 , . 

- = y 2 -ay x and (1) 

(Note that for this example, the exact equations of motion are linear.) 
We may now introduce the Liapounov function 

Fiyun) =iyl + i u; oy 2 i > 0 


(2) 
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for all y\ and y 2 . The time rate of change of the Liapounov function is 


dF _ dF dyi dF dy 2 _ 22 

dt dy\ dt dy 2 dt au} oyi • 


( 3 ) 


(b) The steady state occurs at yi = 0 and y 2 = 0. Since F(y\,y 2 ) > 0 for 
all the space around it and since for a > 0, dF/dt < 0 in all the space 
around it, then for a > 0 the steady state is stable. In fact it is a stable 
focus. When a = 0, the steady state is a center. It is marginally stable. 
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densities (microscopic) 
energy density, 321 
kinetic energy operator, 789 
momentum density, 320 
number density operator, 320, 596, 789 
spin density operator, 789 
density fluctuations (particle number), 
353,559,598 
light scattering, 598-600 
density matrix (see probability density 
operator) 
density of states, 
bogolons, 416 

Debye solid, 366-368, 406-407 
Einstein solid, 367-368 
Fermi surface, 400 
lattice vibrations, 405, 407 
aluminium, 368 

density operator (see probability density 
operator) 

detailed balance, 230, 244-246, 578, 

681 

microscopic, 555 

diamagnetism (superconductor), 119-120 
diatomic molecules, 424 
Dieterici equation of state, 94 
differentials (see exact differential) 
diffusion coefficient (see transport 
coefficients) 
diffusion equation, 

Markov chain, 241 
Brownian particle, 271 
mean free path arguments, 663 
diffusional mobility (see transport 
coefficients) 
dilute solution, 77 
dipole moment density, 595 
direct correlation function, 510-512 
discontinuous systems (membranes), 
605-612 

dispersion relations (see also 

hydrodynamic normal modes) 
lattice vibrations, 402 
sound, 364 

dissipative structures, 721-722, 742 
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distinguishable particles, 347-348, 
362-381 

distribution function (see probability 
distribution function) 
divergences in virial expansion of 
correlation function, 716 
Doppler shift (scattered light), 593 
dyatic tensor, 540 

efficiency of heat engine, 25-26 
eigenvalues, 

Boltsmann collision operator, 685 
Hamiltonian, 522-523 
Lorentz-Boltzmann collision operator, 
686 

nonsymmetric matrix, 235-237 
eigenvectors, (see eigenvalues) 

Einstein diffusion coefficient (see 
transport coefficients) 

Einstein fluctuation theory, 349-354 
fluid systems, 351-354, 560 
Einstein solid, 345-346, 362-364 
entropy, 345-346, 363 
heat capacity, 346, 365 
Helmholtz free energy, 363 
internal energy, 346 
elastic collisions, 670-679 
electric polarization field, 20 
electrical neutrality, 86 
electrochemical potential, 48, 87, 583, 
601, 610 
electrolytes, 

thermodynamic properties, 86-89 
transport, 583-586, 610 
electroosmosis, 611 
electroosmotic pressure — EOP, 611 
endothermic reaction, 83 
energy absorption (see power absorption) 
energy current density (see current) 
energy shell, 346 
energy surface, 298, 341 
invariant area element, 298-299 
ensemble, 296 
enthalpy, 40-42 
ideal gas, 41 
latent heat, 103 
entropy, 29-31, 341 

classical ideal gas (Sackur-Tetrode 
equation), 35, 348, 361 


Einstein solid, 346 
Fermi-Dirac gas, 414 
Fermi superfluid, 414 
fluctuations in isolated systems, 56, 
350, 554 

Gibbs entropy, 342 
//-theorem, 680-682 
hydrodynamic equation (see balance 
equation) 

liquid He 3 , 125-126 
entropy of mixing, 12-1 A 
entropy production, 31, 537 
continuous multicomponent systems, 
574-586 

chemical reaction, 578 
electron thermal and electric 
conduction, 601 
general expression, 577 
ion diffusion and conduction, 

583 

salt-water solution, 585 
solvent-solute mixtures, 580-581 
superfluid, 635 

discontinuous multicomponent systems 
(membranes) 
electrolytes, 610-611 
water and solute, 606 
due to fluctuations, 554 
general expression near equilibrium, 724 
generalized Joule heating, 539 
minimum entropy production, 724 
simple fluid, 539, 543 
e-expansion for S 4 model, 460 
equations of state, 

Berthelot equation, 169 
chemical, 34 
Curies law, 21 
Dieterici equation, 94 
elastic rod, 19 
electric polarization, 20 
general properties, 34 
ideal classical gas, 16 
mechanical, 16-21, 34 
Pade approximate for hard-sphere gas, 
507 

solids, 19 
surface tension, 20 
thermal, 34 

van der Waals equation, 18 
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equations of state ( contd .) 

virial expansion for classical gas, 17, 
498 

virial expansion for quantum gas, 17, 
519 

equilibrium (see thermodynamic 
equilibrium) 

equipartition of energy, 253, 284, 326, 
356, 368 

ergodicity, 286, 300, 349 
classical dynamical systems, 296-303, 
343 

Markov chains, 230 
Wiener-Khinchine theorem, 557-558 
ergodic flow, 296-303, 322 
ergodic theorem, 298, 558 
excape probability, 202 
Euler’s equation (see fundamental 
equation) 

event (in probability theory), 175 
exact differential, 11-16 
exothermic reaction, 83 
extensive variables, 12 
extremum principles, 
enthalpy, 42 

entropy, 31, 57, 344, 354, 377 
free energy, 129 

Gibbs free energy, 46, 80, 106, 117 
Grand potential, 50, 143 
Helmholtz free energy, 44 

Faraday (F), 48 
Fermi-Dirac fluid, 343 
ideal gas, 382, 392-401 
superconductor, 407-417 
Fermi-Dirac statistics, 123, 381-382 
Fermi energy, 393, 398 
Fermi momentum, 393 
Fermi sea, 394 
Fermi surface, 400 
fermions, 357 
Fick’s law, 580-581, 663 
field operators, 788 

first law of thermodynamics (see laws of 
thermodynamics) 

first-order phase transition, 96, 101 
classical fluids, 103-110 
He 3 — He 4 mixture, 127 
He 3 , 126 


He 4 , 124 

superconductor, 121 

first passage time (see mean first passage 
time) 

first sound (see sound) 
fluctuation-dissipation theorem, 532, 562, 
568, 651 

microscopic theory, 589-592 
fluids in equilibrium, 618-620 
fluctuations, 

conditional probability, 553-555 
correlation matrix (dynamic), 553, 557 
density fluctuations, 353, 491-492, 599 
Einstein fluctuation theory, 349-354 
energy fluctuations, 356 
entropy of, 56, 554 
light scattering, 559 
magnetization density, 430 
near critical points, 427 
particle number, 380 
probability distribution, 350, 432, 553 
regression of fluctuations, 555-556 
temperature, 353 
flux (see current) 

Fokker-Planck equation, 230, 266-278 
derivation, 266-270 
strong friction limit, 270-271 
spectral properties, 272-275 
fountain effect, 149 
fourth sound (see sound) 
fractional deviation, 191, 356, 380 
frames of reference, 
center of mass, 672 
laboratory, 632-633 
superflud, rest frame, 632-633 
free energy (see thermodynamic potential) 
Ginzburg-Landau, 129, 431-432 
free expansion, 66 
Frobenius-Peron operator, 324 
fugacity, 383, 388, 393 
fundamental distribution law, 301 
fundamental equation, 33-34 
for ideal gas, 36 
enthalpy, 40 
entropy, 34 
Gibbs free energy, 45 
Grand potential, 48, 378 
Helmholtz free energy, 42, 355 
internal energy, 37, 537 
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fusion curve (see coexistence curve) 

Galilean transformation, 633 
gap function, 409—417 
tin, 417 

gamma distribution function, 228 
gamma function, 218 
gas constant ( R ), 16 
gauge symmetry, 128 
superconductors, 409 
superfluids, 648-649 
gauge transformation, 648 
Gaussian probability density, 191-192, 
208 

Brownian motion 272, 275 
central limit theorem, 198, 213 
Fokker-Planck equation, 272 
Kolmogorov formula, 224 
multivariant, 187, 351, 553 
random walk, 196, 241 
white noise, 252, 268, 571 
Wick’s theorem, 187 
Gaussian model (Ising system), 451 
fixed points, 458 
Gauss’s theorem, 267, 535 
Gegenbauer polynomials, 266 
generalized currents, 554 
generalized displacement, 23, 344, 356, 
378 

generalized forces (hydrodynamic), 
541-554 

generalized forces (thermodynamic), 23, 
344, 356, 378 

generalized Ohm’s law (see Ohm’s law 
(generalized)) 
generating function, 
for random walk, 202 
for Master equation, 262-265 
Gibbs counting factor, 348 
Gibbs-Duhem equation, 35, 154, 352 
superfluid, 633 
Gibbs entropy, 342, 377 
microcanonial ensemble, 344 
Gibbs free energy, 45-48 
binary mixture, 95, 153-159, 169 
chemical reactions, 80-83 
classification of phase transitions, 
100-103 

Ising model, 371 


exact (1-dim), 371 
mean field approximation, 372 
magnetic system, 65 
mixtures, 62-63, 73-78, 170 
stability, 65, 106, 117 
superconductor, 163 
Gibbs paradox, 72, 74, 347-348, 359 
Gibbs phase rule, 98-100 
Ginzburg-Landau theory, 

continuous phase transitions, 128-134 
free energy, 129 
Ising lattice, 431-433 
magnetic system, 133 
superconductor, 162-166 
superfluid, 131 

time-dependent, 722, 764-765 
global stability, 130 
Goldstone bosons, 645-646 
grand canonical ensemble, 342, 377-401 
particle number fluctuations, 380 
relation to canonical ensemble, 380 
reduced distribution function, 509 
grand partition function, 378 
black-body radiation, 418 
Bose-Einstein gas, 381-383 
cluster (Ursell) functions, 493-495 
cumulent expansion, 
classical, 493-495 
quantum, 518 

Fermi-Dirac gas, 381-382, 393 
hard sphere gas, 418 
roots of, 420-422 
grand potential, 48 
black-body radiation, 379 
Bose-Einstein gas, 383, 385 
cluster expansion, 498 
cumulent expansion, 
classical, 48, 498 
quantum, 518 
Fermi-Dirac gas, 393-394 
interacting gas 
classical, 498 
quantum, 518 
liquid droplets, 143 
graphs, 464, 468, 470, 472 

H-theorem, 680-682 
Hamiltonian, 287 

anharmonic system, 326 
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Hamiltonian ( contd .) 
canonical ensemble, 354-356 
classical interacting gas, 489 
Einstein solid, 363 
Fermi-superfluid, 408, 411 
harmonic lattice (1-dim), 401-403 
harmonic oscillator system (quantum), 
306 

Henon-Heiles system, 326 
ideal gas, 358 

internal degrees of freedom, 360 
Ising system, 369, 371, 437, 463 
magnetic system, 309, 337, 429 
particle in gravitational field, 292 
phonons, 364, 403 
spin blocks, 437, 441 
two-body interactions, 672-673 
Walker-Ford system, 331-334 
Hamilton’s equations, 287, 297, 673 
hard-core potential, 419-420 
hard sphere gas, 620 
heat capacity, 50-53 
aluminium, 367 

Bose-Einstein ideal gas, 390, 391 
classical solid, 367 
copper, 367 
at critical point 
Ising mean field, 373 
Ising (2-d), 484 
magnetic solid, 151 
PVT system, 138 
van der Waals gas, 141 
Debye solid, 366-367 
Einstein solid, 346, 367 
Fermi-Dirac gas, 399 
Fermi superfluid, 415-417 
Helmholtz free energy, 42-44 
He 4 , 125 
ideal gas, 44 
Ising model, 

mean field approximation, 375 
exact (2-d), 483 
liquid He 4 , 125 

liquid-vapor coexistence region, 
113-114 

lattice vibrations, 404-405 
magnetic gas, 363 
Rb 2 CoF 4 (Ising-like), 484 
stability of matter, 64 


heat current (random), 612 
heat engine, ( see Carnot engine) 
heat of reaction, 83 
Heaviside function, 178 
Heisenberg picture, 295, 305 
helium liquids, 123-126 
He 3 , 123 

He 4 , 123-124, 146-149 
He I, 124 
He II, 124 

Helmholtz free energy, 42-44 
canonical ensemble, 355 
classical ideal gas, 361 
Einstein solid, 363 
Ising model (2-d), 483 
superconductor, 162 
partition function, 355 
Henon-Heiles system, 326-328 
Hermite polynomials, 274 
Hertz potential, 595-596 
Hessian, 329 

Hittorf transference number, 652 
homogeneous functions, 433-434 
entropy, 33 

free energies, 427, 434 
power law dependence, 434 
Hook’s law, 19 

hydraulic conductance (see transport 
coefficients) 

hydrodynamic equations (see also balance 
equation) 

broken symmetries, 644-649 
conserved quantities, 642-644 
diffusion equation, 271 
derived from Boltzmann equation, 695 
derived from Loren tz-Boltzmann 
equation, 689 

general definition, 534, 552, 639-649 
linearized, 545-552, 613-614, 

621-624 

Navier-Stokes equations, 543, 743 
stochastic, 612-614 
superfluid, 631-639 
hydrodynamic normal modes, 549, 
550-552, 552 

broken symmetries, 644-647 
conserved quantities, 642-644 
dispersion relations (hydrodynamic) 
longitudinal modes, 551 
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transverse modes, 549 
first sound (superfluid), 637-638 
second sound (superfluid), 638 
fourth sound (superfluid), 638, 654 
sound modes, 546 
dispersion relations (microscopic) 
diffusion, 690-691 
entropy modes, 700 
shear modes, 700 
general definition, 639-649 
lifetime, 549, 551, 646 
light scattering experiments, 599-600 
superfluid, 635-636 
hydrostatic pressure, 536 
hyperbolic fixed point, 443 
hysteresis in chemical systems, 734 

ice point of water, 28 
ideal fluid (hydrodynamic), 
classical, 535-536, 539, 546 
superfluid, 635-639 
ideal gas, 
classical, 

Carnot engine, 28 
enthalpy, 41 

equation of state, 16, 391 
Helmholtz free energy, 44 
microcanonical ensemble, 348 
thermodynamic properties, 28-29, 
35, 41, 44, 54-55, 348-349, 
360-361 

quantum, 381—401 
Bose-Einstein, 382-392 
Fermi-Dirac, 381, 392-401 
ideal mixture, 581 
incompressible fluid, 
phase space, 289, 291 
hydrodynamic, 622 
independent events, 177, 185 
independent stochastic variables, 233 
indistinguishable particles, 347-348, 
356-362, 381 
inflection point, 115 

infinitely divisible distribution, 207-211, 
221-224 
Cauchy, 209 
Gaussian, 208 
Levy, 210 
Poisson, 209 


insulating wall, 21 
integrals of motion, 297 
intensive variables, 12 
internal energy, 37-40 
classical interacting fluid, 490 
Einstein solid, 346 
Fermi-Dirac gas, 399 
ideal gas, 36 

Ising model (2-d), 483-484 
magnetic gas, 362 
phonons, 365, 404 
intersection of events, 176 
irreversibility, 334 
Ising system, 369, 377 
critical exponents, 461 
exact solution, 

1 - dimension, 370-372 

2- dimensions, 478, 462-485 
Gaussian model, 451 

mean field approximation, 372-377 
S 4 model, 448-162 
Yang-Lee theory, 421 
isolated system, 22, 342 
isothermal compressibility, {see 
compressibility) 
isothermal process, 24, 104 
van der Waals gas, 116 
vapor-liquid transition, 104, 111 
isothermal susceptibility, 53 
isotropic fluid, 534, 543 

Jacobian of transformation, 289-290 
joint probability density, 184, 231-232 
Joule coefficient, 67 
Dieterici equation, 94 
ideal gas, 67 

van der Waals gas, 67-68 
Joule effect, 10, 66-68 
Joule heating, 533, 539, 554 
Joule-Kelvin coefficient, 70 
ideal gas, 70 
van der Waals gas, 70 
Joule-Kelvin effect, 68-72 
inversion curve, 71 

Kadanoff scaling exponents (u, rf), 
427—128 

KAM theorem, 326-331 
Kelvin temperature scale, 28 
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kinetic equation, 
classical, 313 
quantum, 317 

Kolmogorov formula, 213, 221, 223-224 
Kramers-Kronig relations, 564-565, 571, 
592 

Kramers-Moyal expansion, 276-278 

laboratory frame of reference, 632-633, 
672-677 

Lagrange multipliers, 344, 346, 354-355, 
377-378 
A-line 

in He 4 , 124 

in He 3 — He 4 mixtures, 127 
A-point, 

Fermi superfluid, 416-417 
He 4 , 125 

magnetic systems, 134 
nickel, 132 
superconductor, 122 
Ginsberg-Landau theory, 131-132 
Langevin equation of motion, 230, 
251-254 

harmonically bound Brownian particle, 
571, 651 

entropy current, 612 
momentum current, 612 
Brownian particle with memory, 625 
Laplace transform, 548, 616, 626 
latent heat, 13, 106 
binary mixture, 162 
fusion, 108 
sublimation, 108 
superconductor, 121 
vaporization, 109 

lattice vibrations, 345-346, 362-368 
1 -dimensional lattice, 401-407 
law of corresponding states, 111 
van der Waals system, 116 
law of large numbers, 198-199 
law of mass action, 82 
laws of thermodynamics, 
first law, 9, 22-23, 34 
second law, 9, 23, 34 
third law, 31-33 
Bose-Einstein gas, 391 
Fermi-Dirac gas, 399 
helium, 123, 125 


superconductor, 122 
zeroth law, 9, 20 
Le Chatelier’s principle, 61 
Legendre transformation, 40 
Lennard-Jones 6-12 potential, 500-505, 
511 

lever rule, 1 1 1 

Levy distribution, 210, 214, 218 
Levy flight, 218-221 
Levy-Khintchine formula, 221-222 
light scattering, 532, 592-600 
experiment, 600 
intensity, 599 
limit cycles, 795 
Limit Theorems, 173, 207 
Central Limit Theorem, 197-198, 
211-214 

linear response theory (dynamic), 427, 

561- 574 

Brownian particle, 571-574 
dynamic response function-matrix, 

562- 571 

fluctuation-dissipation theorem, 
568-571 

light scattering, 592-600 
microscopic theory, 589-592 
linearized hydrodynamic equations 
simple fluid, 545-552 
with noise, 613-614 
general form, 556 
flow around Brownian particle, 
621-624 

Liouville equation, 286 
classical, 291-292 
quantum, 305 
Liouville operator, 
classical, 291-292, 312 
quantum, 305 
Lipschitz conditions, 791 
liquid crystal, 168 
liquid He 3 ( see helium liquids) 
liquid He 4 (see helium liquids) 
liquid-solid transition, 104 
liquid- vapor transition, 104 
liquification of gases, 66-72 
local equilibrium, 537, 656 
long range order, 431 
long time tails, 533, 653 
experimental observation, 629-630 
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hydrodynamic origin, 620-631 
molecular dynamics, 620-621, 
710-711 

Lorentz-Boltzmann collision operator, 

686 

eigenfunctions, 690-691 
eigenvalues, 686, 690-691 
Lorentz-Boltzmann equation, 684 
derivation of diffusion coefficient, 
690-691 

derivation of diffusion equation, 688 
dispertion relation of diffusion modes, 
689 

perturbation expansion, 690-691 
Lorentzian line shape, 561 
Lyapounov function, 730 

magnetic field strength, 21 
magnetic induction, 21 
magnetic susceptibility, 151, 429-431 
Ising model (mean field 
approximation), 376 
magnetization, 133, 429 
Ising model (mean field 
approximation), 373-374 
magnetic gas, 362 
magnetization density operator, 429 
Markov chains, 229, 234-241 
random walk, 240-241 
time periodic, 258-259 
Markov processes, 229, 233 
discrete variables, 234-241 
ergodic, 238 
regular, 237 
stationary, 232 

master equation, 229, 241-250 
approximation to 276-278 
birth-death processes, 260-266 
detailed balance, 244-245 
Kramers-Moyal expansion, 276-278 
mean first passage time, 247-250 
Maxwell-Boltzmann distribution, 
657-658 

Maxwell construction, 117-118 
Maxwell relations, 
internal energy, 38 
enthalpy, 41 
Gibbs free energy, 45 
Grand potential, 49 


Helmholtz free energy, 43 
mean field approximation, 
correlation function 
fluid system, 351-353 
magnetic system, 431-432 
Ising model, 372-377 
superconductor, 408-417 
mean field theories, 128-135, 372-377, 
408-417 

(see also Ginzburg-Landau theory) 
mean first passage time, 247-249 
mean free path, 658-659, 661 
mechanical equilibrium, 57, 142 
with surface tension, 143 
mechanical stability, 61, 111 
van der Waals gas, 116-117 
median (of probability distribution), 181 
membranes, 75-78, 88-89, 606-612, 

653 

volume flow (water and solute), 
606-612 

reflection coefficient (semipermeable 
membrane), 609 
ion transport, 610-612 
memory function (matrix), 642 
transport coefficients, 643 
metastable state, 61, 111, 157 
binary mixture, 
classical, 157 
He 3 — He 4 , 127 
van der Waals gas, 115-117 
metric transitivity, 300 
microcanonical ensemble, 301, 342-354 
entropy, 344 

thermodynamic relations, 344 
microscopic reversibility, 553-555 
minimum entropy production, 721, 724 
mixing flow, 296, 303, 321-325 
mixtures, 62-63, 72-77, 88-89, 95 
transport properties, 574-586 
molecular chaos, 680 
molecular dynamics experiments 
equation of state, 506 
radial distribution function, 511 
velocity auto correlation function, 620, 
621, 631, 710-711 
mole fraction, 73 

moments (of stochastic variable), 180-181 
binomial distribution, 190 
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momentum condensation, 

Bose-Einstein gas, 388 
Fermi-Dirac fluid (superconductor), 
407-417, 

momentum current tensor (see current) 
momentum eigenstates, 
antisymmetrized, 357, 520-521 
symmetrized, 357, 520-521 
momentum flux (see momentum current) 
momentum operator, 775 
most probable value (of stochastic 
variable), 181 
multinomial theorem, 174 
multiplication principle, 174 
mutually exclusive events, 176 

natural boundary, 262, 265 
Navier-Stokes equations, 543-549, 743 
linearized, 545-549, 621, 692-695 
neutron scattering, 368, 490-492, 593 
Nemst potential, 89 
Newtonian dynamics, 334-335 
Noble Prize, 126, 408, 556, 732 
nonequilibrium phase transition, 721 
Rayleigh Benard, 749 
Schlogl chemical model, 734 
Brusselator chemical model, 740-742 
fluctuations near critical point, 753-765 
nonequilibrium thermodynamics, 47, 
722-726 

nonlinear chemical reactions (see 
autocatalytic chemical reactions) 
nonlinear equations, 721 
nonlinear waves, 735-742 
nth-order phase transition, 101 
number density (see particle density) 
number operator, 363, 381, 409 
number representation, 357 

Ohm’s law, 118, 539 
Ohm’s law (generalized), 
chemical reaction, 579 
electrolyte, 585, 606-611, 652 
electron thermal and electrical 
conduction, 601 

ion diffusion and conduction, 583-584 
momentum correlation 
transverse, 543 
longitudinal, 543 


random, 612 

solvent-solute diffusion, 581 
superfluid, 635 
thermal conduction, 543 
random, 612 

thermal diffusion (binary mixture), 582 
thermocouple, 653 

Onsager’s relations, 531-532, 552, 556, 
725 

chemical reaction, 579, 580 
electrolyte, 610-612 
experimental test, 612 
electron thermal and electron 
conductivity, 601 
experimental test, 604-605 
general proof, 553-556 
magnetic systems, 586-589 
rotating systems, 586-589 
superfluids, 635 
thermocouple, 654 
water-solute transport across 
membrane, 609 
open system, 22, 49, 342, 377 
order-disorder transitions, 369-377 
information storage, 462 
Ising model, 369-377 
lattice gas, 369 
learning, 462 
binary alloy, 369 
DNA, 369, 462 
order parameter, 97 
Bose-Einstein ideal gas, 389 
broken symmetry, 645-649 
Ising model, 372-374 
magnetic systems, 373-374, 647-648 
nonequilibrium phase transition, 721 
superconductor, 409 
superfluid, 648-649 
Ginzberg-Landau theory, 128-135, 
162-166 
Oregonator, 735 

Omstein-Zemicke equation, 510-511 
orthogonal transformation 
lattice vibrations, 403 
multivariant Gaussian, 187 
osmosis, 74 
electroosmosis, 611 

osmotic flow (see transport coefficients) 
osmotic pressure, 74-78, 608 
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Pade approximate, 507 
paramagnetic system, 21, 369-372 
partial enthalpy, 47, 160 
partial entropy, 47, 160, 575 
partial Gibbs free energy, 47, 160, 575 
partial volume, 47, 77, 160 
partial pressure, 72 
partial wave expansion, 523-524 
particle current density (see current) 
particle number 

Bose-Einstein gas, 383, 385, 388 
Fermi-Dirac gas, 393-394, 399-400 
fluctuations, 380 
Bose-Einstein gas, 392 
Fermi-Dirac gas, 399-400 
partition coefficient, 89 
partition function, 

canonical ensemble, 355 
classical ideal gas, 360 
Debye solid, 365 
Einstein solid, 363 
Ising model, 369 

1 - dimension, 370-371 

2- dimension, 463-481 

mean field approximation, 372 
magnetic gas, 361 
mean field, 431 
phonons, 365, 404 
spin blocks, 441 
S 4 model, 449 

partition of sample space, 176 
Pauli exclusion principle, 382 
Peltier effect, 600-603 
Peltier heat, 601-603, 654 
penetration depth in superconductors, 
165-166 

periodic boundary conditions, 381 
Percus-Yevick equation, 511-513 
comparison to experiment, 513 
equation of state, 512 
permeability, 120, 606-612 
permutation, 174 
permutation operator, 
persistent random walk, 207, 216 
perturbation expansions (Rayleigh- 
Schrodinger), 
phase diagrams 
argon, 513 

binary mixture, 157-158 


Bose-Einstein gas, 390 
He 3 , 125-126 
He 4 , 124 

He 3 — He 4 mixtures, 127 
magnetic systems, 150 
PVT systems, 104 
superconductor, 121 
water, 99 

phase functions, 294, 311, 489, 509 
particle density, 295 
phase space, 

Brownian particle, 266-270 
Hamiltonian, 285, 297 
phase transitions, 
equilibrium, 96-170 
binary mixture, 157 
Bose-Einstein gas, 383 
classical fluid, 103-114 
continuous (see continuous phase 
transition) 

He 3 , 124-126 
He 4 , 123-124 

He 3 — He 4 mixture, 126-127 
Ising model, 372-377, 407-417 
superconductor, 118-122 
PVT system, 103-114 
van der Waals gas, 115-118 
Yang-Lee theory, 418-422 
nonequilibrium 

Brusselator, 735-736 
Rayleigh-Benard instability, 742-452 
Schlogl model, 732-734 
phonons, 379 
photons, 364, 403 
Planck’s constant (h = 2 tyH), 342 
Planck’s formula, 365, 404 
Poisson bracket, 291 
Poisson distribution, 192-194, 209 
polarization field (electric), 20 
polar vector, 543 
position operator, 775 
positive definite matrix, 350 
potentials (see thermodynamic potentials) 
power absorption from external field, 

570 

Brownian particle, 571-574 
delta function force, 570 
oscillating force, 570-571 
Prandl number, 747, 756 
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pressure, 
argon, 513 

black-body radiation, 92, 380 
Bose-Einstein gas, 385, 387, 390, 396 
cluster expansion for classical gas, 498 
Fermi-Dirac gas, 396 
ideal classical gas, 16, 391, 396 
Pade approximate, 507 
Percus-Yevick, 512-513 
radial distribution function, 508 
van der Waals fluid, 18 
viral expansion, 17 
classical, 498 
quantum, 519 
Yang-Lee model, 422 
pressure equation, 508-509, 512 
pressure tensor, 535, 693-694 
probability (definition), 175, 198 
probability current, 270 
probability density function, 178 
classical phase space, 286-296 
jointly distributed variables, 183 
probability density operator 
canonical ensemble, 355 
grand canonical ensemble, 378 
harmonic oscillator, 306-308 
mixed state, 304 
pure state, 304 
spin system, 308-309 
time evolution, 305, 590 
probability distribution function, 178, 
231-233 

continuous variables, 179 
discrete variables, 178 
jointly distributed variables, 183, 
231-233 

time dependent, 231-233 
projection operators, 640-642 

quantum liquids ( see helium liquids) 

radial distribution function, 488-491 
argon, 491 

compressibility equation, 510 
experimental properties, 490-491 
internal energy, 490^492 
Percus-Yevick equation, 511-513 
pressure equation, 508 
radio-active decay, 281 


random variable, see stochastic variables 
random walk, 194-197 
binomial distribution, 194-197 
Markov chain, 240-241 
one-dimension lattice, 203-207 
self- similar, 221 

two and three dimensional lattices, 
203-207 

rate equations (chemical reactions), 79, 
668-670 

Schlogl chemical model, 733 
Brusselator chemical model, 736-737 
rate of reaction, 668 
Rayleigh-Benard system, 722, 742-752 
boundary conditions, 746 
linear stability analysis, 747-751 
nonequilibrium steady states, 744 
soft-mode transition, 749 
Rayleigh peak (scattered light), 561, 
599-600 
experiment, 600 
Rayleigh number, 748 

critical Rayleigh number, 750 
Rayleigh-Pearson random walk, 218-221 
Rayleigh-Schrodinger perturbation 
expansion, 690-691, 697-700 
reactant, 667 

radial distribution function, 489, 509 
reduced density matrix, 314-316 
time evolution, 316 
Wigner functions, 315-319 
reduced probability densities, 
classical phase space, 311-313 
reduced variables, 111, 116 
refrigerator, 26 

regression of fluctuations, 555 
relative coordinates, 672 
relative thermoelectric power, 604-605 
relevant eigenvalues, 444 
renormalization theory, 377, 440-462 
renormalization transformation, 441-444 
fixed points, 442-444, 458 
eigenvalues, 442-443, 458 
relevant eigenvalues, 444, 458 
S 4 -model, 458 
triangular planar lattice, 447 
representations of operators, 775 
resistance 

of mercury, 118 
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resonances, 327-334 
response (to external fields) 
constant field, 567-568 
general definition, 562 
microscopic, 591 

response function-matrix (dynamic) 
Brownian particle, 571-574 
causality, 562-567 
dynamic susceptibility, 563-567 
general definition, 562-563 
microscopic, 591 

response function-matrix (static), 10, 12, 
50-55 

Einstein fluctuation theory, 350-354, 
553-557, 568-569 
fluids, 353 
ideal gas, 54-55 
mechanical, 53-54 
thermal, 50-53 

response matrix (see response matrix) 
retarded solution, 595 
reversible change, 22 
Riemann zeta function, 386 
rotating systems, 586 
rotation invariance (of tensors), 619 
rotational symmetry (see symmetry) 
Rushbrook inequality, 139 

S 4 model, 448-462 
critical exponents, 460-461 
Sackur-Tetrode equation, 35, 348, 361 
sample space, 175 
saturated vapor pressure, 106 
scaling, 427, 433-440 
Kadanoff scaling, 428, 437-440 
Widom scaling, 427, 434-437 
scattering angle, 674 
scattering phase shift, 524-525 
scattering theory, 523-526, 671-679 
Schlogl chemical model, 722, 732-734 
Schrodinger equation, 522-523 
Schrodinger picture, 295, 305 
Schwartz inequality, 646 
second law of thermodynamics (see laws 
of thermodynamics) 
second sound, (see sound) 
second virial coefficient (see virial 
coefficients) 

Seebeck effect, 603-604 


self-diffusion coefficient (see diffusion 
coefficient) 

shear viscosity (rj) (see transport 
coefficients) 

Slater determinant, 779 
solids, 

Debye theory, 364-368, 406 
Einstein theory, 345-346, 362-364 
equation of state, 19 
solution (of particles), 75-78 
Sonine polynomials, 702 
Soret coefficient, 583 
sound 

classical fluid (hydrodynamic), 546, 
560 

dispersion relation, 364 
first sound (superfluid), 637-638 
fourth sound (superfluid), 654 
light scattering from, 593-600 
second sound (superfluid), 638 
solids, 364-365 
speed, 546 

spectral density matrix (see spectral 
density function-matrix) 
spectral density function-matrix, 
Brownian motion, 254-257 
broken symmetry, 644-647 
density fluctuations, 561 
dynamic correlation function, 558 
dynamic structure factor 

density fluctuations, 559-561, 598 
fluctuation theory, 558-559 
hydrodynamic poles, 647 
memory function, 642 
microscopic theory, 592, 639-647 
conserved quantities, 642-643 
projection operators, 640-642 
scattered light, 598-599 
transport coefficients, 643 
Weiner-Khintchine theorem, 558 
white noise, 257 
spherical harmonics, 523 
spontaneous process, 22 
square- well potential, 501 
stability, 

binary mixtures, 62, 154-155 
classical fluid, 104, 1 1 
chemical, 61 
Gibbs free energy, 63 
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stability ( contd .) 

Helmholtz free energy, 64 
global, 130 

linear stability theory, 791-795 
magnetic system, 65 
mechanical, 61 

nonequilibrium steady states, 722 
near equilibrium, 723-726 
far from equilibrium, 726-732 
nonlinear chemical reactions, 728-732 
thermodynamic, 57-65, 154-155 
thermodynamic stability theory, 
723-732 

standard deviation, 180 
binomial distribution, 190, 194 
random walk, 197 
state variables, 1 1 
state vector, 287 
stationary state, 322 
Liouville equation, 292, 306 
stationary stochastic process, 232, 255 
statistically independent fluctuation, 353 
Stefan-Boltzmann constant, 380 
Stirling’s approximation, 360 
stochastic hydrodynamics, 612-620 
stochastic variables, 177 
jointly distributed, 1 83 
stoichiometric coefficients, 78 
Stoke’s friction, 253, 620, 624 
Stosszahl-Ansatz, 680 
streaming current (membrane) 611 
stress tensor, 536, 541 
random, 612, 619 
structure factor (static), 
magnetic system, 430-432 
classical interacting gas, 490-492, 5 1 1 
argon, 491 

structure function (phase space), 299, 341, 
346 

classical gas, 299-300 
structure function (compressibility 
equation), 510 

sublimation curve ( see coexistence 
curves) 

superconductor, 118-122, 162-166, 
408^117 

experimental properties, 118-122 
Ginzburg-Landau theory, 162-166 
microscopic theory (BCS), 408-417 


supercurrent, 119, 164 
superfluid, 146-149 
fountain effect, 149 
He 3 , 123-125 
A-phase, 126 
B-phase, 126 
He 4 , 124, 146-149 
hydrodynamic equations, 631-639 
two-fluid model, 631-639 
surface tension, 20, 142-146 
table of, 143 

susceptibility (dynamic), 563-567, 568 
causality, 563-567 
power absorption, 570-571 
Brownian particle, 572-574 
microscopic, 592 
susceptibility (static), 430-432 
adiabatic, 53 
Ginzberg-Landau, 133 
Ising model (mean field 
approximation), 376 
isothermal, 53 
magnetic, 151, 429-430 
near critical point, 151, 431-432 
symmetries, 
gauge, 128 
rotation, 128 
translation, 128 

symmetrized states, 357, 359, 520-521 

Tchebycheff inequality, 198-199 
temperature fluctuations, 353 
temperature scale, 

Kelvin scale, 28 
Thomson scale, 27 
tension, 19-20 
tensors, 

unit tensor, 536 
mathematical properties, 542 
thermal conductivity (K) ( see transport 
coefficients) 

thermal expansivity, 53-54, 614 
thermal stability, 61 

thermal wavelength, 358, 360-361, 385, 
394 

thermocouple, 600, 653-654 
thermodynamic branch, 721 
thermodynamic equilibrium, 31, 40, 42, 
44, 46, 50, 55-57 76, 341 
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binary mixture, 155-158 
chemical, 80, 143 
entropy, 350 

liquid droplets and vapor, 142-146 
thermodynamic forces (see generalized 
forces) 

thermodynamic limit, 347-348, 356, 481 
thermodynamic potentials, 36-50 ( see 
also internal energy, enthalpy, 
Helmholtz free energy, Gibbs free 
energy, and grand potential) 
thermodynamic stability (see stability) 
thermoelectricity, 600-612 
thermomechanical effect, 146-149 
third law of thermodynamics (see laws of 
thermodynamics) 

Thomson heat, 605, 654 
Thomson temperature scale, 27 
three-body polarization forces, 514-515 
throttling process, 68-71 
time average, 298 
time-reversal invariance, 
detailed balance, 555 
hydrodynamics, 553-555 
transient random walk, 207, 216 
transition matrix, 235-236 
eigenvalues, 235-237 
eigenvectors, 235-237 
Master equation, 243 
spectral properties, 235-237, 243 
transition probability, 233-234 
transition probability rate, 242 
translational symmetry, 128 
transport coefficients, 531, 534 
bulk viscosity (£), 543 
diffusion coefficient ( D ) 

Brownian particle, 253 
Einstein diffusion coefficient, 250, 
581, 628 

experimental values, 581-582, 

584 

hard sphere gas, 704 
Lorentz-Boltzmann, 704 
mean free path theory, 663 
random walk, 196 
solvent-solute, 581 

diffusion coefficient (electrolyte) (D s ) 
586 

diffusional mobility (Ldd), 609 


electric conductivity (k), 584 
electroosmotic pressure-EOP (—Lpe/ 

Lpp ), 611 

experimental value, 612 
equivilent conductance (A), 584 
experimental values, 584 
hydraulic conductance (Lpp), 609 
kinetic viscosity 
longitudinal (u{), 549, 560 
transverse (v{), 549 
osmotic flow (Lpo), 609 
rotational, 629 

shear viscosity (rj), 250, 253, 593 
Boltzmann equation, 700 
experimental values, 706 
hard sphere gas, 709 
mean free path theory, 665, 666 
superfluid, 635 
water, 581 

streaming current-SC (Lep/L pp ), 611 
experimental values, 612 
superfluid viscosities (Ci, ( 2 , C3> ( 4 )* 
635 

thermal conductivity (X - ) 

Boltzmann equation, 700 
classical fluid, 543 
experimental values, 706 
hard sphere gas, 705-708 
superfluid, 635 

thermal diffusivity (x), 549, 560 
thermal diffusion coefficient (D T ), 
583 

ultrafiltration (L DP ), 690 
transport theory, 656 
triangular lattice (see Ising model) 
tricritical point, 123, 127, 151-153 
in He 3 — He 4 mixture, 127, 153 
triple point, 103, 127 
classical fluid, 103, 109 
water, 28 

Turing structures, 761-764 
two-fluid model (see superfluids) 

ultrafiltration (see transport coefficients) 
ultrasonic acoustic attenuation, 417 
uniform function, 645 
union of events, 176 
universality, 443 

Ursell functions, (see cluster functions) 
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van der Waals constants, 19 
van der Waals equation, 18, 67-68, 70-71, 
115-116, 139-141 
van der Waals gas, 18 
van’t Hoff’s law, 77 

vaporization curve {see coexistence curve) 
vapor pressure, 142 
equilibrium with droplets, 142-146 
variance, 180 

Brownian particle, 628-631 
binomial distribution, 190 
density fluctuations, 353 
energy fluctuations, 356 
particle number fluctuations, 
Bose-Einstein gas, 392 
Fermi-Dirac gas, 399-400 
temperature fluctuations, 353 
vector potential, 162, 595 
velocity autocorrelation function, 
Brownian particle, 252-254, 256-257 
hard sphere gas, 620-621 
long time tails, 620-631 
virial expansion, 712-716 
virial coefficients, 499 
Bose-Einstein gas 17, 520-526 
classical interacting gas, 17, 499-505, 
513-517 

cluster integrals, 499 
molecular dynamics, 506-507 
second virial coefficient (classical), 17, 
500-505 

experimental values, 504 
hard sphere gas, 505 
Lennard- Jones potential, 501-505 
square- well potential, 501, 504 
second virial coefficient (quantum) 
Bose-Einstein hard sphere gas, 
525-526 

ideal Fermi gas, 519-520 
hard-sphere gas, 505-507 
star function expansion, 500 
third virial coefficient, 17, 513-517 
argon, 516 


hard sphere gas, 516-517 
Lennard- Jones potential, 515 
square- well potential, 514 
three-body forces, 514-515 
Fermi-Dirac gas, 17, 519-526 
quantum gas, 17, 517-526 
virial expansion 
equation of state, 488, 519 
velocity autocorrelation function, 715 
divergences, 716 

viscosity {see transport coefficients) 
volume flow (across membranes), 
606-611 
vorticity, 621 

water (phases), 99 
Weierstrass function, 215 
Weierstrass probability density, 220 
Weierstrass random walk, 214-217 
Weyl correspondence, 317 
white noise, 251, 571 
spectral density, 257 
Wicks theorem, 187 
S 4 model, 454 

Widom scaling exponents {p, q), 434, 447, 
444 

Wiener-Khintchine theorem, 532, 
557-558 

Wigner functions, 286, 315-319 
Wilson theory of critical exponents, 428, 
440-462 

S 4 model, 448-462 
triangular planar lattice, 444-448 
work, 23 

X-ray scattering experiments, 488 

Yang-Lee theory of phase transitions, 
418-422 

zeroth law of thermodynamics {see laws 
of thermodynamics) 



